Cargando…

From complex data to biological insight: ‘DEKER’ feature selection and network inference

Network inference is a valuable approach for gaining mechanistic insight from high-dimensional biological data. Existing methods for network inference focus on ranking all possible relations (edges) among all measured quantities such as genes, proteins, metabolites (features) observed, which yields...

Descripción completa

Detalles Bibliográficos
Autores principales: Hayes, Sean M. S., Sachs, Jeffrey R., Cho, Carolyn R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8837529/
https://www.ncbi.nlm.nih.gov/pubmed/34791577
http://dx.doi.org/10.1007/s10928-021-09792-7
_version_ 1784649929658990592
author Hayes, Sean M. S.
Sachs, Jeffrey R.
Cho, Carolyn R.
author_facet Hayes, Sean M. S.
Sachs, Jeffrey R.
Cho, Carolyn R.
author_sort Hayes, Sean M. S.
collection PubMed
description Network inference is a valuable approach for gaining mechanistic insight from high-dimensional biological data. Existing methods for network inference focus on ranking all possible relations (edges) among all measured quantities such as genes, proteins, metabolites (features) observed, which yields a dense network that is challenging to interpret. Identifying a sparse, interpretable network using these methods thus requires an error-prone thresholding step which compromises their performance. In this article we propose a new method, DEKER-NET, that addresses this limitation by directly identifying a sparse, interpretable network without thresholding, improving real-world performance. DEKER-NET uses a novel machine learning method for feature selection in an iterative framework for network inference. DEKER-NET is extremely flexible, handling linear and nonlinear relations while making no assumptions about the underlying distribution of data, and is suitable for categorical or continuous variables. We test our method on the Dialogue for Reverse Engineering Assessments and Methods (DREAM) challenge data, demonstrating that it can directly identify sparse, interpretable networks without thresholding while maintaining performance comparable to the hypothetical best-case thresholded network of other methods. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s10928-021-09792-7.
format Online
Article
Text
id pubmed-8837529
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-88375292022-02-23 From complex data to biological insight: ‘DEKER’ feature selection and network inference Hayes, Sean M. S. Sachs, Jeffrey R. Cho, Carolyn R. J Pharmacokinet Pharmacodyn Original Paper Network inference is a valuable approach for gaining mechanistic insight from high-dimensional biological data. Existing methods for network inference focus on ranking all possible relations (edges) among all measured quantities such as genes, proteins, metabolites (features) observed, which yields a dense network that is challenging to interpret. Identifying a sparse, interpretable network using these methods thus requires an error-prone thresholding step which compromises their performance. In this article we propose a new method, DEKER-NET, that addresses this limitation by directly identifying a sparse, interpretable network without thresholding, improving real-world performance. DEKER-NET uses a novel machine learning method for feature selection in an iterative framework for network inference. DEKER-NET is extremely flexible, handling linear and nonlinear relations while making no assumptions about the underlying distribution of data, and is suitable for categorical or continuous variables. We test our method on the Dialogue for Reverse Engineering Assessments and Methods (DREAM) challenge data, demonstrating that it can directly identify sparse, interpretable networks without thresholding while maintaining performance comparable to the hypothetical best-case thresholded network of other methods. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s10928-021-09792-7. Springer US 2021-11-17 2022 /pmc/articles/PMC8837529/ /pubmed/34791577 http://dx.doi.org/10.1007/s10928-021-09792-7 Text en © Merck Sharp & Dohme Corp., a subsidiary of Merck & Co., Inc., Kenilworth, N.J., U.S.A., under exclusive licence to Springer Science+Business Media 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Original Paper
Hayes, Sean M. S.
Sachs, Jeffrey R.
Cho, Carolyn R.
From complex data to biological insight: ‘DEKER’ feature selection and network inference
title From complex data to biological insight: ‘DEKER’ feature selection and network inference
title_full From complex data to biological insight: ‘DEKER’ feature selection and network inference
title_fullStr From complex data to biological insight: ‘DEKER’ feature selection and network inference
title_full_unstemmed From complex data to biological insight: ‘DEKER’ feature selection and network inference
title_short From complex data to biological insight: ‘DEKER’ feature selection and network inference
title_sort from complex data to biological insight: ‘deker’ feature selection and network inference
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8837529/
https://www.ncbi.nlm.nih.gov/pubmed/34791577
http://dx.doi.org/10.1007/s10928-021-09792-7
work_keys_str_mv AT hayesseanms fromcomplexdatatobiologicalinsightdekerfeatureselectionandnetworkinference
AT sachsjeffreyr fromcomplexdatatobiologicalinsightdekerfeatureselectionandnetworkinference
AT chocarolynr fromcomplexdatatobiologicalinsightdekerfeatureselectionandnetworkinference