Cargando…
Challenges in the Analysis of Mass-Throughput Data: A Technical Commentary from the Statistical Machine Learning Perspective
Sound data analysis is critical to the success of modern molecular medicine research that involves collection and interpretation of mass-throughput data. The novel nature and high-dimensionality in such datasets pose a series of nontrivial data analysis problems. This technical commentary discusses...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Libertas Academica
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2675497/ https://www.ncbi.nlm.nih.gov/pubmed/19458765 |
_version_ | 1782166701569212416 |
---|---|
author | Aliferis, Constantin F. Statnikov, Alexander Tsamardinos, Ioannis |
author_facet | Aliferis, Constantin F. Statnikov, Alexander Tsamardinos, Ioannis |
author_sort | Aliferis, Constantin F. |
collection | PubMed |
description | Sound data analysis is critical to the success of modern molecular medicine research that involves collection and interpretation of mass-throughput data. The novel nature and high-dimensionality in such datasets pose a series of nontrivial data analysis problems. This technical commentary discusses the problems of over-fitting, error estimation, curse of dimensionality, causal versus predictive modeling, integration of heterogeneous types of data, and lack of standard protocols for data analysis. We attempt to shed light on the nature and causes of these problems and to outline viable methodological approaches to overcome them. |
format | Text |
id | pubmed-2675497 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | Libertas Academica |
record_format | MEDLINE/PubMed |
spelling | pubmed-26754972009-05-20 Challenges in the Analysis of Mass-Throughput Data: A Technical Commentary from the Statistical Machine Learning Perspective Aliferis, Constantin F. Statnikov, Alexander Tsamardinos, Ioannis Cancer Inform Technical Note Sound data analysis is critical to the success of modern molecular medicine research that involves collection and interpretation of mass-throughput data. The novel nature and high-dimensionality in such datasets pose a series of nontrivial data analysis problems. This technical commentary discusses the problems of over-fitting, error estimation, curse of dimensionality, causal versus predictive modeling, integration of heterogeneous types of data, and lack of standard protocols for data analysis. We attempt to shed light on the nature and causes of these problems and to outline viable methodological approaches to overcome them. Libertas Academica 2007-02-16 /pmc/articles/PMC2675497/ /pubmed/19458765 Text en © 2006 The authors. This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/). |
spellingShingle | Technical Note Aliferis, Constantin F. Statnikov, Alexander Tsamardinos, Ioannis Challenges in the Analysis of Mass-Throughput Data: A Technical Commentary from the Statistical Machine Learning Perspective |
title | Challenges in the Analysis of Mass-Throughput Data: A Technical Commentary from the Statistical Machine Learning Perspective |
title_full | Challenges in the Analysis of Mass-Throughput Data: A Technical Commentary from the Statistical Machine Learning Perspective |
title_fullStr | Challenges in the Analysis of Mass-Throughput Data: A Technical Commentary from the Statistical Machine Learning Perspective |
title_full_unstemmed | Challenges in the Analysis of Mass-Throughput Data: A Technical Commentary from the Statistical Machine Learning Perspective |
title_short | Challenges in the Analysis of Mass-Throughput Data: A Technical Commentary from the Statistical Machine Learning Perspective |
title_sort | challenges in the analysis of mass-throughput data: a technical commentary from the statistical machine learning perspective |
topic | Technical Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2675497/ https://www.ncbi.nlm.nih.gov/pubmed/19458765 |
work_keys_str_mv | AT aliferisconstantinf challengesintheanalysisofmassthroughputdataatechnicalcommentaryfromthestatisticalmachinelearningperspective AT statnikovalexander challengesintheanalysisofmassthroughputdataatechnicalcommentaryfromthestatisticalmachinelearningperspective AT tsamardinosioannis challengesintheanalysisofmassthroughputdataatechnicalcommentaryfromthestatisticalmachinelearningperspective |