Cargando…
Contrast trees and distribution boosting
A method for decision tree induction is presented. Given a set of predictor variables [Formula: see text] and two outcome variables [Formula: see text] and [Formula: see text] associated with each [Formula: see text] , the goal is to identify those values of [Formula: see text] for which the respect...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
National Academy of Sciences
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7474603/ https://www.ncbi.nlm.nih.gov/pubmed/32817416 http://dx.doi.org/10.1073/pnas.1921562117 |
_version_ | 1783579361840988160 |
---|---|
author | Friedman, Jerome H. |
author_facet | Friedman, Jerome H. |
author_sort | Friedman, Jerome H. |
collection | PubMed |
description | A method for decision tree induction is presented. Given a set of predictor variables [Formula: see text] and two outcome variables [Formula: see text] and [Formula: see text] associated with each [Formula: see text] , the goal is to identify those values of [Formula: see text] for which the respective distributions of [Formula: see text] and [Formula: see text] , or selected properties of those distributions such as means or quantiles, are most different. Contrast trees provide a lack-of-fit measure for statistical models of such statistics, or for the complete conditional distribution [Formula: see text] , as a function of [Formula: see text]. They are easily interpreted and can be used as diagnostic tools to reveal and then understand the inaccuracies of models produced by any learning method. A corresponding contrast-boosting strategy is described for remedying any uncovered errors, thereby producing potentially more accurate predictions. This leads to a distribution-boosting strategy for directly estimating the full conditional distribution of [Formula: see text] at each [Formula: see text] under no assumptions concerning its shape, form, or parametric representation. |
format | Online Article Text |
id | pubmed-7474603 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | National Academy of Sciences |
record_format | MEDLINE/PubMed |
spelling | pubmed-74746032020-09-18 Contrast trees and distribution boosting Friedman, Jerome H. Proc Natl Acad Sci U S A Physical Sciences A method for decision tree induction is presented. Given a set of predictor variables [Formula: see text] and two outcome variables [Formula: see text] and [Formula: see text] associated with each [Formula: see text] , the goal is to identify those values of [Formula: see text] for which the respective distributions of [Formula: see text] and [Formula: see text] , or selected properties of those distributions such as means or quantiles, are most different. Contrast trees provide a lack-of-fit measure for statistical models of such statistics, or for the complete conditional distribution [Formula: see text] , as a function of [Formula: see text]. They are easily interpreted and can be used as diagnostic tools to reveal and then understand the inaccuracies of models produced by any learning method. A corresponding contrast-boosting strategy is described for remedying any uncovered errors, thereby producing potentially more accurate predictions. This leads to a distribution-boosting strategy for directly estimating the full conditional distribution of [Formula: see text] at each [Formula: see text] under no assumptions concerning its shape, form, or parametric representation. National Academy of Sciences 2020-09-01 2020-08-19 /pmc/articles/PMC7474603/ /pubmed/32817416 http://dx.doi.org/10.1073/pnas.1921562117 Text en Copyright © 2020 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/ https://creativecommons.org/licenses/by-nc-nd/4.0/This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) . |
spellingShingle | Physical Sciences Friedman, Jerome H. Contrast trees and distribution boosting |
title | Contrast trees and distribution boosting |
title_full | Contrast trees and distribution boosting |
title_fullStr | Contrast trees and distribution boosting |
title_full_unstemmed | Contrast trees and distribution boosting |
title_short | Contrast trees and distribution boosting |
title_sort | contrast trees and distribution boosting |
topic | Physical Sciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7474603/ https://www.ncbi.nlm.nih.gov/pubmed/32817416 http://dx.doi.org/10.1073/pnas.1921562117 |
work_keys_str_mv | AT friedmanjeromeh contrasttreesanddistributionboosting |