Cargando…

Random forests with R

This book offers an application-oriented guide to random forests: a statistical learning method extensively used in many fields of application, thanks to its excellent predictive performance, but also to its flexibility, which places few restrictions on the nature of the data used. Indeed, random fo...

Descripción completa

Detalles Bibliográficos
Autores principales: Genuer, Robin, Poggi, Jean-Michel
Lenguaje:eng
Publicado: Springer 2020
Materias:
Acceso en línea:https://dx.doi.org/10.1007/978-3-030-56485-8
http://cds.cern.ch/record/2740521
_version_ 1780968332699107328
author Genuer, Robin
Poggi, Jean-Michel
author_facet Genuer, Robin
Poggi, Jean-Michel
author_sort Genuer, Robin
collection CERN
description This book offers an application-oriented guide to random forests: a statistical learning method extensively used in many fields of application, thanks to its excellent predictive performance, but also to its flexibility, which places few restrictions on the nature of the data used. Indeed, random forests can be adapted to both supervised classification problems and regression problems. In addition, they allow us to consider qualitative and quantitative explanatory variables together, without pre-processing. Moreover, they can be used to process standard data for which the number of observations is higher than the number of variables, while also performing very well in the high dimensional case, where the number of variables is quite large in comparison to the number of observations. Consequently, they are now among the preferred methods in the toolbox of statisticians and data scientists. The book is primarily intended for students in academic fields such as statistical education, but also for practitioners in statistics and machine learning. A scientific undergraduate degree is quite sufficient to take full advantage of the concepts, methods, and tools discussed. In terms of computer science skills, little background knowledge is required, though an introduction to the R language is recommended. Random forests are part of the family of tree-based methods; accordingly, after an introductory chapter, Chapter 2 presents CART trees. The next three chapters are devoted to random forests. They focus on their presentation (Chapter 3), on the variable importance tool (Chapter 4), and on the variable selection problem (Chapter 5), respectively. After discussing the concepts and methods, we illustrate their implementation on a running example. Then, various complements are provided before examining additional examples. Throughout the book, each result is given together with the code (in R) that can be used to reproduce it. Thus, the book offers readers essential information and concepts, together with examples and the software tools needed to analyse data using random forests. .
id cern-2740521
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2020
publisher Springer
record_format invenio
spelling cern-27405212021-04-21T16:45:48Zdoi:10.1007/978-3-030-56485-8http://cds.cern.ch/record/2740521engGenuer, RobinPoggi, Jean-MichelRandom forests with RMathematical Physics and MathematicsThis book offers an application-oriented guide to random forests: a statistical learning method extensively used in many fields of application, thanks to its excellent predictive performance, but also to its flexibility, which places few restrictions on the nature of the data used. Indeed, random forests can be adapted to both supervised classification problems and regression problems. In addition, they allow us to consider qualitative and quantitative explanatory variables together, without pre-processing. Moreover, they can be used to process standard data for which the number of observations is higher than the number of variables, while also performing very well in the high dimensional case, where the number of variables is quite large in comparison to the number of observations. Consequently, they are now among the preferred methods in the toolbox of statisticians and data scientists. The book is primarily intended for students in academic fields such as statistical education, but also for practitioners in statistics and machine learning. A scientific undergraduate degree is quite sufficient to take full advantage of the concepts, methods, and tools discussed. In terms of computer science skills, little background knowledge is required, though an introduction to the R language is recommended. Random forests are part of the family of tree-based methods; accordingly, after an introductory chapter, Chapter 2 presents CART trees. The next three chapters are devoted to random forests. They focus on their presentation (Chapter 3), on the variable importance tool (Chapter 4), and on the variable selection problem (Chapter 5), respectively. After discussing the concepts and methods, we illustrate their implementation on a running example. Then, various complements are provided before examining additional examples. Throughout the book, each result is given together with the code (in R) that can be used to reproduce it. Thus, the book offers readers essential information and concepts, together with examples and the software tools needed to analyse data using random forests. .Springeroai:cds.cern.ch:27405212020
spellingShingle Mathematical Physics and Mathematics
Genuer, Robin
Poggi, Jean-Michel
Random forests with R
title Random forests with R
title_full Random forests with R
title_fullStr Random forests with R
title_full_unstemmed Random forests with R
title_short Random forests with R
title_sort random forests with r
topic Mathematical Physics and Mathematics
url https://dx.doi.org/10.1007/978-3-030-56485-8
http://cds.cern.ch/record/2740521
work_keys_str_mv AT genuerrobin randomforestswithr
AT poggijeanmichel randomforestswithr