Cargando…

netDx: Software for building interpretable patient classifiers by multi-'omic data integration using patient similarity networks

Patient classification based on clinical and genomic data will further the goal of precision medicine. Interpretability is of particular relevance for models based on genomic data, where sample sizes are relatively small (in the hundreds), increasing overfitting risk netDx is a machine learning meth...

Descripción completa

Detalles Bibliográficos
Autores principales: Pai, Shraddha, Weber, Philipp, Isserlin, Ruth, Kaka, Hussam, Hui, Shirley, Shah, Muhammad Ahmad, Giudice, Luca, Giugno, Rosalba, Nøhr, Anne Krogh, Baumbach, Jan, Bader, Gary D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7883323/
https://www.ncbi.nlm.nih.gov/pubmed/33628435
http://dx.doi.org/10.12688/f1000research.26429.2
_version_ 1783651194751680512
author Pai, Shraddha
Weber, Philipp
Isserlin, Ruth
Kaka, Hussam
Hui, Shirley
Shah, Muhammad Ahmad
Giudice, Luca
Giugno, Rosalba
Nøhr, Anne Krogh
Baumbach, Jan
Bader, Gary D.
author_facet Pai, Shraddha
Weber, Philipp
Isserlin, Ruth
Kaka, Hussam
Hui, Shirley
Shah, Muhammad Ahmad
Giudice, Luca
Giugno, Rosalba
Nøhr, Anne Krogh
Baumbach, Jan
Bader, Gary D.
author_sort Pai, Shraddha
collection PubMed
description Patient classification based on clinical and genomic data will further the goal of precision medicine. Interpretability is of particular relevance for models based on genomic data, where sample sizes are relatively small (in the hundreds), increasing overfitting risk netDx is a machine learning method to integrate multi-modal patient data and build a patient classifier. Patient data are converted into networks of patient similarity, which is intuitive to clinicians who also use patient similarity for medical diagnosis. Features passing selection are integrated, and new patients are assigned to the class with the greatest profile similarity. netDx has excellent performance, outperforming most machine-learning methods in binary cancer survival prediction. It handles missing data – a common problem in real-world data – without requiring imputation. netDx also has excellent interpretability, with native support to group genes into pathways for mechanistic insight into predictive features. The netDx Bioconductor package provides multiple workflows for users to build custom patient classifiers. It provides turnkey functions for one-step predictor generation from multi-modal data, including feature selection over multiple train/test data splits. Workflows offer versatility with custom feature design, choice of similarity metric; speed is improved by parallel execution. Built-in functions and examples allow users to compute model performance metrics such as AUROC, AUPR, and accuracy. netDx uses RCy3 to visualize top-scoring pathways and the final integrated patient network in Cytoscape. Advanced users can build more complex predictor designs with functional building blocks used in the default design. Finally, the netDx Bioconductor package provides a novel workflow for pathway-based patient classification from sparse genetic data.
format Online
Article
Text
id pubmed-7883323
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-78833232021-02-23 netDx: Software for building interpretable patient classifiers by multi-'omic data integration using patient similarity networks Pai, Shraddha Weber, Philipp Isserlin, Ruth Kaka, Hussam Hui, Shirley Shah, Muhammad Ahmad Giudice, Luca Giugno, Rosalba Nøhr, Anne Krogh Baumbach, Jan Bader, Gary D. F1000Res Software Tool Article Patient classification based on clinical and genomic data will further the goal of precision medicine. Interpretability is of particular relevance for models based on genomic data, where sample sizes are relatively small (in the hundreds), increasing overfitting risk netDx is a machine learning method to integrate multi-modal patient data and build a patient classifier. Patient data are converted into networks of patient similarity, which is intuitive to clinicians who also use patient similarity for medical diagnosis. Features passing selection are integrated, and new patients are assigned to the class with the greatest profile similarity. netDx has excellent performance, outperforming most machine-learning methods in binary cancer survival prediction. It handles missing data – a common problem in real-world data – without requiring imputation. netDx also has excellent interpretability, with native support to group genes into pathways for mechanistic insight into predictive features. The netDx Bioconductor package provides multiple workflows for users to build custom patient classifiers. It provides turnkey functions for one-step predictor generation from multi-modal data, including feature selection over multiple train/test data splits. Workflows offer versatility with custom feature design, choice of similarity metric; speed is improved by parallel execution. Built-in functions and examples allow users to compute model performance metrics such as AUROC, AUPR, and accuracy. netDx uses RCy3 to visualize top-scoring pathways and the final integrated patient network in Cytoscape. Advanced users can build more complex predictor designs with functional building blocks used in the default design. Finally, the netDx Bioconductor package provides a novel workflow for pathway-based patient classification from sparse genetic data. F1000 Research Limited 2021-01-22 /pmc/articles/PMC7883323/ /pubmed/33628435 http://dx.doi.org/10.12688/f1000research.26429.2 Text en Copyright: © 2021 Pai S et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software Tool Article
Pai, Shraddha
Weber, Philipp
Isserlin, Ruth
Kaka, Hussam
Hui, Shirley
Shah, Muhammad Ahmad
Giudice, Luca
Giugno, Rosalba
Nøhr, Anne Krogh
Baumbach, Jan
Bader, Gary D.
netDx: Software for building interpretable patient classifiers by multi-'omic data integration using patient similarity networks
title netDx: Software for building interpretable patient classifiers by multi-'omic data integration using patient similarity networks
title_full netDx: Software for building interpretable patient classifiers by multi-'omic data integration using patient similarity networks
title_fullStr netDx: Software for building interpretable patient classifiers by multi-'omic data integration using patient similarity networks
title_full_unstemmed netDx: Software for building interpretable patient classifiers by multi-'omic data integration using patient similarity networks
title_short netDx: Software for building interpretable patient classifiers by multi-'omic data integration using patient similarity networks
title_sort netdx: software for building interpretable patient classifiers by multi-'omic data integration using patient similarity networks
topic Software Tool Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7883323/
https://www.ncbi.nlm.nih.gov/pubmed/33628435
http://dx.doi.org/10.12688/f1000research.26429.2
work_keys_str_mv AT paishraddha netdxsoftwareforbuildinginterpretablepatientclassifiersbymultiomicdataintegrationusingpatientsimilaritynetworks
AT weberphilipp netdxsoftwareforbuildinginterpretablepatientclassifiersbymultiomicdataintegrationusingpatientsimilaritynetworks
AT isserlinruth netdxsoftwareforbuildinginterpretablepatientclassifiersbymultiomicdataintegrationusingpatientsimilaritynetworks
AT kakahussam netdxsoftwareforbuildinginterpretablepatientclassifiersbymultiomicdataintegrationusingpatientsimilaritynetworks
AT huishirley netdxsoftwareforbuildinginterpretablepatientclassifiersbymultiomicdataintegrationusingpatientsimilaritynetworks
AT shahmuhammadahmad netdxsoftwareforbuildinginterpretablepatientclassifiersbymultiomicdataintegrationusingpatientsimilaritynetworks
AT giudiceluca netdxsoftwareforbuildinginterpretablepatientclassifiersbymultiomicdataintegrationusingpatientsimilaritynetworks
AT giugnorosalba netdxsoftwareforbuildinginterpretablepatientclassifiersbymultiomicdataintegrationusingpatientsimilaritynetworks
AT nøhrannekrogh netdxsoftwareforbuildinginterpretablepatientclassifiersbymultiomicdataintegrationusingpatientsimilaritynetworks
AT baumbachjan netdxsoftwareforbuildinginterpretablepatientclassifiersbymultiomicdataintegrationusingpatientsimilaritynetworks
AT badergaryd netdxsoftwareforbuildinginterpretablepatientclassifiersbymultiomicdataintegrationusingpatientsimilaritynetworks