Cargando…
High-Dimensional Statistical Learning: Roots, Justifications, and Potential Machineries
High-dimensional data generally refer to data in which the number of variables is larger than the sample size. Analyzing such datasets poses great challenges for classical statistical learning because the finite-sample performance of methods developed within classical statistical learning does not l...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Libertas Academica
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4830639/ https://www.ncbi.nlm.nih.gov/pubmed/27081307 http://dx.doi.org/10.4137/CIN.S30804 |
_version_ | 1782426930843222016 |
---|---|
author | Zollanvari, Amin |
author_facet | Zollanvari, Amin |
author_sort | Zollanvari, Amin |
collection | PubMed |
description | High-dimensional data generally refer to data in which the number of variables is larger than the sample size. Analyzing such datasets poses great challenges for classical statistical learning because the finite-sample performance of methods developed within classical statistical learning does not live up to classical asymptotic premises in which the sample size unboundedly grows for a fixed dimensionality of observations. Much work has been done in developing mathematical–statistical techniques for analyzing high-dimensional data. Despite remarkable progress in this field, many practitioners still utilize classical methods for analyzing such datasets. This state of affairs can be attributed, in part, to a lack of knowledge and, in part, to the ready-to-use computational and statistical software packages that are well developed for classical techniques. Moreover, many scientists working in a specific field of high-dimensional statistical learning are either not aware of other existing machineries in the field or are not willing to try them out. The primary goal in this work is to bring together various machineries of high-dimensional analysis, give an overview of the important results, and present the operating conditions upon which they are grounded. When appropriate, readers are referred to relevant review articles for more information on a specific subject. |
format | Online Article Text |
id | pubmed-4830639 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Libertas Academica |
record_format | MEDLINE/PubMed |
spelling | pubmed-48306392016-04-14 High-Dimensional Statistical Learning: Roots, Justifications, and Potential Machineries Zollanvari, Amin Cancer Inform Review High-dimensional data generally refer to data in which the number of variables is larger than the sample size. Analyzing such datasets poses great challenges for classical statistical learning because the finite-sample performance of methods developed within classical statistical learning does not live up to classical asymptotic premises in which the sample size unboundedly grows for a fixed dimensionality of observations. Much work has been done in developing mathematical–statistical techniques for analyzing high-dimensional data. Despite remarkable progress in this field, many practitioners still utilize classical methods for analyzing such datasets. This state of affairs can be attributed, in part, to a lack of knowledge and, in part, to the ready-to-use computational and statistical software packages that are well developed for classical techniques. Moreover, many scientists working in a specific field of high-dimensional statistical learning are either not aware of other existing machineries in the field or are not willing to try them out. The primary goal in this work is to bring together various machineries of high-dimensional analysis, give an overview of the important results, and present the operating conditions upon which they are grounded. When appropriate, readers are referred to relevant review articles for more information on a specific subject. Libertas Academica 2016-04-12 /pmc/articles/PMC4830639/ /pubmed/27081307 http://dx.doi.org/10.4137/CIN.S30804 Text en © 2015 the author(s), publisher and licensee Libertas Academica Ltd. This is an open-access article distributed under the terms of the Creative Commons CC-BY-NC 3.0 license. |
spellingShingle | Review Zollanvari, Amin High-Dimensional Statistical Learning: Roots, Justifications, and Potential Machineries |
title | High-Dimensional Statistical Learning: Roots, Justifications, and Potential Machineries |
title_full | High-Dimensional Statistical Learning: Roots, Justifications, and Potential Machineries |
title_fullStr | High-Dimensional Statistical Learning: Roots, Justifications, and Potential Machineries |
title_full_unstemmed | High-Dimensional Statistical Learning: Roots, Justifications, and Potential Machineries |
title_short | High-Dimensional Statistical Learning: Roots, Justifications, and Potential Machineries |
title_sort | high-dimensional statistical learning: roots, justifications, and potential machineries |
topic | Review |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4830639/ https://www.ncbi.nlm.nih.gov/pubmed/27081307 http://dx.doi.org/10.4137/CIN.S30804 |
work_keys_str_mv | AT zollanvariamin highdimensionalstatisticallearningrootsjustificationsandpotentialmachineries |