Cargando…

Deep exploration of random forest model boosts the interpretability of machine learning studies of complicated immune responses and lung burden of nanoparticles

The development of machine learning provides solutions for predicting the complicated immune responses and pharmacokinetics of nanoparticles (NPs) in vivo. However, highly heterogeneous data in NP studies remain challenging because of the low interpretability of machine learning. Here, we propose a...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Fubo, Wei, Changhong, Deng, Peng, Peng, Ting, Hu, Xiangang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Association for the Advancement of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8153727/
https://www.ncbi.nlm.nih.gov/pubmed/34039604
http://dx.doi.org/10.1126/sciadv.abf4130
_version_ 1783698863185461248
author Yu, Fubo
Wei, Changhong
Deng, Peng
Peng, Ting
Hu, Xiangang
author_facet Yu, Fubo
Wei, Changhong
Deng, Peng
Peng, Ting
Hu, Xiangang
author_sort Yu, Fubo
collection PubMed
description The development of machine learning provides solutions for predicting the complicated immune responses and pharmacokinetics of nanoparticles (NPs) in vivo. However, highly heterogeneous data in NP studies remain challenging because of the low interpretability of machine learning. Here, we propose a tree-based random forest feature importance and feature interaction network analysis framework (TBRFA) and accurately predict the pulmonary immune responses and lung burden of NPs, with the correlation coefficient of all training sets >0.9 and half of the test sets >0.75. This framework overcomes the feature importance bias brought by small datasets through a multiway importance analysis. TBRFA also builds feature interaction networks, boosts model interpretability, and reveals hidden interactional factors (e.g., various NP properties and exposure conditions). TBRFA provides guidance for the design and application of ideal NPs and discovers the feature interaction networks that contribute to complex systems with small-size data in various fields.
format Online
Article
Text
id pubmed-8153727
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Association for the Advancement of Science
record_format MEDLINE/PubMed
spelling pubmed-81537272021-06-07 Deep exploration of random forest model boosts the interpretability of machine learning studies of complicated immune responses and lung burden of nanoparticles Yu, Fubo Wei, Changhong Deng, Peng Peng, Ting Hu, Xiangang Sci Adv Research Articles The development of machine learning provides solutions for predicting the complicated immune responses and pharmacokinetics of nanoparticles (NPs) in vivo. However, highly heterogeneous data in NP studies remain challenging because of the low interpretability of machine learning. Here, we propose a tree-based random forest feature importance and feature interaction network analysis framework (TBRFA) and accurately predict the pulmonary immune responses and lung burden of NPs, with the correlation coefficient of all training sets >0.9 and half of the test sets >0.75. This framework overcomes the feature importance bias brought by small datasets through a multiway importance analysis. TBRFA also builds feature interaction networks, boosts model interpretability, and reveals hidden interactional factors (e.g., various NP properties and exposure conditions). TBRFA provides guidance for the design and application of ideal NPs and discovers the feature interaction networks that contribute to complex systems with small-size data in various fields. American Association for the Advancement of Science 2021-05-26 /pmc/articles/PMC8153727/ /pubmed/34039604 http://dx.doi.org/10.1126/sciadv.abf4130 Text en Copyright © 2021 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works. Distributed under a Creative Commons Attribution NonCommercial License 4.0 (CC BY-NC). https://creativecommons.org/licenses/by-nc/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial license (https://creativecommons.org/licenses/by-nc/4.0/) , which permits use, distribution, and reproduction in any medium, so long as the resultant use is not for commercial advantage and provided the original work is properly cited.
spellingShingle Research Articles
Yu, Fubo
Wei, Changhong
Deng, Peng
Peng, Ting
Hu, Xiangang
Deep exploration of random forest model boosts the interpretability of machine learning studies of complicated immune responses and lung burden of nanoparticles
title Deep exploration of random forest model boosts the interpretability of machine learning studies of complicated immune responses and lung burden of nanoparticles
title_full Deep exploration of random forest model boosts the interpretability of machine learning studies of complicated immune responses and lung burden of nanoparticles
title_fullStr Deep exploration of random forest model boosts the interpretability of machine learning studies of complicated immune responses and lung burden of nanoparticles
title_full_unstemmed Deep exploration of random forest model boosts the interpretability of machine learning studies of complicated immune responses and lung burden of nanoparticles
title_short Deep exploration of random forest model boosts the interpretability of machine learning studies of complicated immune responses and lung burden of nanoparticles
title_sort deep exploration of random forest model boosts the interpretability of machine learning studies of complicated immune responses and lung burden of nanoparticles
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8153727/
https://www.ncbi.nlm.nih.gov/pubmed/34039604
http://dx.doi.org/10.1126/sciadv.abf4130
work_keys_str_mv AT yufubo deepexplorationofrandomforestmodelbooststheinterpretabilityofmachinelearningstudiesofcomplicatedimmuneresponsesandlungburdenofnanoparticles
AT weichanghong deepexplorationofrandomforestmodelbooststheinterpretabilityofmachinelearningstudiesofcomplicatedimmuneresponsesandlungburdenofnanoparticles
AT dengpeng deepexplorationofrandomforestmodelbooststheinterpretabilityofmachinelearningstudiesofcomplicatedimmuneresponsesandlungburdenofnanoparticles
AT pengting deepexplorationofrandomforestmodelbooststheinterpretabilityofmachinelearningstudiesofcomplicatedimmuneresponsesandlungburdenofnanoparticles
AT huxiangang deepexplorationofrandomforestmodelbooststheinterpretabilityofmachinelearningstudiesofcomplicatedimmuneresponsesandlungburdenofnanoparticles