Cargando…
Deep exploration of random forest model boosts the interpretability of machine learning studies of complicated immune responses and lung burden of nanoparticles
The development of machine learning provides solutions for predicting the complicated immune responses and pharmacokinetics of nanoparticles (NPs) in vivo. However, highly heterogeneous data in NP studies remain challenging because of the low interpretability of machine learning. Here, we propose a...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Association for the Advancement of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8153727/ https://www.ncbi.nlm.nih.gov/pubmed/34039604 http://dx.doi.org/10.1126/sciadv.abf4130 |
_version_ | 1783698863185461248 |
---|---|
author | Yu, Fubo Wei, Changhong Deng, Peng Peng, Ting Hu, Xiangang |
author_facet | Yu, Fubo Wei, Changhong Deng, Peng Peng, Ting Hu, Xiangang |
author_sort | Yu, Fubo |
collection | PubMed |
description | The development of machine learning provides solutions for predicting the complicated immune responses and pharmacokinetics of nanoparticles (NPs) in vivo. However, highly heterogeneous data in NP studies remain challenging because of the low interpretability of machine learning. Here, we propose a tree-based random forest feature importance and feature interaction network analysis framework (TBRFA) and accurately predict the pulmonary immune responses and lung burden of NPs, with the correlation coefficient of all training sets >0.9 and half of the test sets >0.75. This framework overcomes the feature importance bias brought by small datasets through a multiway importance analysis. TBRFA also builds feature interaction networks, boosts model interpretability, and reveals hidden interactional factors (e.g., various NP properties and exposure conditions). TBRFA provides guidance for the design and application of ideal NPs and discovers the feature interaction networks that contribute to complex systems with small-size data in various fields. |
format | Online Article Text |
id | pubmed-8153727 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | American Association for the Advancement of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-81537272021-06-07 Deep exploration of random forest model boosts the interpretability of machine learning studies of complicated immune responses and lung burden of nanoparticles Yu, Fubo Wei, Changhong Deng, Peng Peng, Ting Hu, Xiangang Sci Adv Research Articles The development of machine learning provides solutions for predicting the complicated immune responses and pharmacokinetics of nanoparticles (NPs) in vivo. However, highly heterogeneous data in NP studies remain challenging because of the low interpretability of machine learning. Here, we propose a tree-based random forest feature importance and feature interaction network analysis framework (TBRFA) and accurately predict the pulmonary immune responses and lung burden of NPs, with the correlation coefficient of all training sets >0.9 and half of the test sets >0.75. This framework overcomes the feature importance bias brought by small datasets through a multiway importance analysis. TBRFA also builds feature interaction networks, boosts model interpretability, and reveals hidden interactional factors (e.g., various NP properties and exposure conditions). TBRFA provides guidance for the design and application of ideal NPs and discovers the feature interaction networks that contribute to complex systems with small-size data in various fields. American Association for the Advancement of Science 2021-05-26 /pmc/articles/PMC8153727/ /pubmed/34039604 http://dx.doi.org/10.1126/sciadv.abf4130 Text en Copyright © 2021 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works. Distributed under a Creative Commons Attribution NonCommercial License 4.0 (CC BY-NC). https://creativecommons.org/licenses/by-nc/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial license (https://creativecommons.org/licenses/by-nc/4.0/) , which permits use, distribution, and reproduction in any medium, so long as the resultant use is not for commercial advantage and provided the original work is properly cited. |
spellingShingle | Research Articles Yu, Fubo Wei, Changhong Deng, Peng Peng, Ting Hu, Xiangang Deep exploration of random forest model boosts the interpretability of machine learning studies of complicated immune responses and lung burden of nanoparticles |
title | Deep exploration of random forest model boosts the interpretability of machine learning studies of complicated immune responses and lung burden of nanoparticles |
title_full | Deep exploration of random forest model boosts the interpretability of machine learning studies of complicated immune responses and lung burden of nanoparticles |
title_fullStr | Deep exploration of random forest model boosts the interpretability of machine learning studies of complicated immune responses and lung burden of nanoparticles |
title_full_unstemmed | Deep exploration of random forest model boosts the interpretability of machine learning studies of complicated immune responses and lung burden of nanoparticles |
title_short | Deep exploration of random forest model boosts the interpretability of machine learning studies of complicated immune responses and lung burden of nanoparticles |
title_sort | deep exploration of random forest model boosts the interpretability of machine learning studies of complicated immune responses and lung burden of nanoparticles |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8153727/ https://www.ncbi.nlm.nih.gov/pubmed/34039604 http://dx.doi.org/10.1126/sciadv.abf4130 |
work_keys_str_mv | AT yufubo deepexplorationofrandomforestmodelbooststheinterpretabilityofmachinelearningstudiesofcomplicatedimmuneresponsesandlungburdenofnanoparticles AT weichanghong deepexplorationofrandomforestmodelbooststheinterpretabilityofmachinelearningstudiesofcomplicatedimmuneresponsesandlungburdenofnanoparticles AT dengpeng deepexplorationofrandomforestmodelbooststheinterpretabilityofmachinelearningstudiesofcomplicatedimmuneresponsesandlungburdenofnanoparticles AT pengting deepexplorationofrandomforestmodelbooststheinterpretabilityofmachinelearningstudiesofcomplicatedimmuneresponsesandlungburdenofnanoparticles AT huxiangang deepexplorationofrandomforestmodelbooststheinterpretabilityofmachinelearningstudiesofcomplicatedimmuneresponsesandlungburdenofnanoparticles |