Cargando…

Network and Data Integration for Biomarker Signature Discovery via Network Smoothed T-Statistics

Predictive, stable and interpretable gene signatures are generally seen as an important step towards a better personalized medicine. During the last decade various methods have been proposed for that purpose. However, one important obstacle for making gene signatures a standard tool in clinics is th...

Descripción completa

Detalles Bibliográficos
Autores principales: Cun, Yupeng, Fröhlich, Holger
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3760887/
https://www.ncbi.nlm.nih.gov/pubmed/24019896
http://dx.doi.org/10.1371/journal.pone.0073074
_version_ 1782282813856284672
author Cun, Yupeng
Fröhlich, Holger
author_facet Cun, Yupeng
Fröhlich, Holger
author_sort Cun, Yupeng
collection PubMed
description Predictive, stable and interpretable gene signatures are generally seen as an important step towards a better personalized medicine. During the last decade various methods have been proposed for that purpose. However, one important obstacle for making gene signatures a standard tool in clinics is the typical low reproducibility of signatures combined with the difficulty to achieve a clear biological interpretation. For that purpose in the last years there has been a growing interest in approaches that try to integrate information from molecular interaction networks. We here propose a technique that integrates network information as well as different kinds of experimental data (here exemplified by mRNA and miRNA expression) into one classifier. This is done by smoothing t-statistics of individual genes or miRNAs over the structure of a combined protein-protein interaction (PPI) and miRNA-target gene network. A permutation test is conducted to select features in a highly consistent manner, and subsequently a Support Vector Machine (SVM) classifier is trained. Compared to several other competing methods our algorithm reveals an overall better prediction performance for early versus late disease relapse and a higher signature stability. Moreover, obtained gene lists can be clearly associated to biological knowledge, such as known disease genes and KEGG pathways. We demonstrate that our data integration strategy can improve classification performance compared to using a single data source only. Our method, called stSVM, is available in R-package netClass on CRAN (http://cran.r-project.org).
format Online
Article
Text
id pubmed-3760887
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-37608872013-09-09 Network and Data Integration for Biomarker Signature Discovery via Network Smoothed T-Statistics Cun, Yupeng Fröhlich, Holger PLoS One Research Article Predictive, stable and interpretable gene signatures are generally seen as an important step towards a better personalized medicine. During the last decade various methods have been proposed for that purpose. However, one important obstacle for making gene signatures a standard tool in clinics is the typical low reproducibility of signatures combined with the difficulty to achieve a clear biological interpretation. For that purpose in the last years there has been a growing interest in approaches that try to integrate information from molecular interaction networks. We here propose a technique that integrates network information as well as different kinds of experimental data (here exemplified by mRNA and miRNA expression) into one classifier. This is done by smoothing t-statistics of individual genes or miRNAs over the structure of a combined protein-protein interaction (PPI) and miRNA-target gene network. A permutation test is conducted to select features in a highly consistent manner, and subsequently a Support Vector Machine (SVM) classifier is trained. Compared to several other competing methods our algorithm reveals an overall better prediction performance for early versus late disease relapse and a higher signature stability. Moreover, obtained gene lists can be clearly associated to biological knowledge, such as known disease genes and KEGG pathways. We demonstrate that our data integration strategy can improve classification performance compared to using a single data source only. Our method, called stSVM, is available in R-package netClass on CRAN (http://cran.r-project.org). Public Library of Science 2013-09-03 /pmc/articles/PMC3760887/ /pubmed/24019896 http://dx.doi.org/10.1371/journal.pone.0073074 Text en © 2013 Cun, Fröhlich http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Cun, Yupeng
Fröhlich, Holger
Network and Data Integration for Biomarker Signature Discovery via Network Smoothed T-Statistics
title Network and Data Integration for Biomarker Signature Discovery via Network Smoothed T-Statistics
title_full Network and Data Integration for Biomarker Signature Discovery via Network Smoothed T-Statistics
title_fullStr Network and Data Integration for Biomarker Signature Discovery via Network Smoothed T-Statistics
title_full_unstemmed Network and Data Integration for Biomarker Signature Discovery via Network Smoothed T-Statistics
title_short Network and Data Integration for Biomarker Signature Discovery via Network Smoothed T-Statistics
title_sort network and data integration for biomarker signature discovery via network smoothed t-statistics
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3760887/
https://www.ncbi.nlm.nih.gov/pubmed/24019896
http://dx.doi.org/10.1371/journal.pone.0073074
work_keys_str_mv AT cunyupeng networkanddataintegrationforbiomarkersignaturediscoveryvianetworksmoothedtstatistics
AT frohlichholger networkanddataintegrationforbiomarkersignaturediscoveryvianetworksmoothedtstatistics