Cargando…

Challenges in the Analysis of Mass-Throughput Data: A Technical Commentary from the Statistical Machine Learning Perspective

Sound data analysis is critical to the success of modern molecular medicine research that involves collection and interpretation of mass-throughput data. The novel nature and high-dimensionality in such datasets pose a series of nontrivial data analysis problems. This technical commentary discusses...

Descripción completa

Detalles Bibliográficos
Autores principales: Aliferis, Constantin F., Statnikov, Alexander, Tsamardinos, Ioannis
Formato: Texto
Lenguaje:English
Publicado: Libertas Academica 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2675497/
https://www.ncbi.nlm.nih.gov/pubmed/19458765
_version_ 1782166701569212416
author Aliferis, Constantin F.
Statnikov, Alexander
Tsamardinos, Ioannis
author_facet Aliferis, Constantin F.
Statnikov, Alexander
Tsamardinos, Ioannis
author_sort Aliferis, Constantin F.
collection PubMed
description Sound data analysis is critical to the success of modern molecular medicine research that involves collection and interpretation of mass-throughput data. The novel nature and high-dimensionality in such datasets pose a series of nontrivial data analysis problems. This technical commentary discusses the problems of over-fitting, error estimation, curse of dimensionality, causal versus predictive modeling, integration of heterogeneous types of data, and lack of standard protocols for data analysis. We attempt to shed light on the nature and causes of these problems and to outline viable methodological approaches to overcome them.
format Text
id pubmed-2675497
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-26754972009-05-20 Challenges in the Analysis of Mass-Throughput Data: A Technical Commentary from the Statistical Machine Learning Perspective Aliferis, Constantin F. Statnikov, Alexander Tsamardinos, Ioannis Cancer Inform Technical Note Sound data analysis is critical to the success of modern molecular medicine research that involves collection and interpretation of mass-throughput data. The novel nature and high-dimensionality in such datasets pose a series of nontrivial data analysis problems. This technical commentary discusses the problems of over-fitting, error estimation, curse of dimensionality, causal versus predictive modeling, integration of heterogeneous types of data, and lack of standard protocols for data analysis. We attempt to shed light on the nature and causes of these problems and to outline viable methodological approaches to overcome them. Libertas Academica 2007-02-16 /pmc/articles/PMC2675497/ /pubmed/19458765 Text en © 2006 The authors. This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).
spellingShingle Technical Note
Aliferis, Constantin F.
Statnikov, Alexander
Tsamardinos, Ioannis
Challenges in the Analysis of Mass-Throughput Data: A Technical Commentary from the Statistical Machine Learning Perspective
title Challenges in the Analysis of Mass-Throughput Data: A Technical Commentary from the Statistical Machine Learning Perspective
title_full Challenges in the Analysis of Mass-Throughput Data: A Technical Commentary from the Statistical Machine Learning Perspective
title_fullStr Challenges in the Analysis of Mass-Throughput Data: A Technical Commentary from the Statistical Machine Learning Perspective
title_full_unstemmed Challenges in the Analysis of Mass-Throughput Data: A Technical Commentary from the Statistical Machine Learning Perspective
title_short Challenges in the Analysis of Mass-Throughput Data: A Technical Commentary from the Statistical Machine Learning Perspective
title_sort challenges in the analysis of mass-throughput data: a technical commentary from the statistical machine learning perspective
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2675497/
https://www.ncbi.nlm.nih.gov/pubmed/19458765
work_keys_str_mv AT aliferisconstantinf challengesintheanalysisofmassthroughputdataatechnicalcommentaryfromthestatisticalmachinelearningperspective
AT statnikovalexander challengesintheanalysisofmassthroughputdataatechnicalcommentaryfromthestatisticalmachinelearningperspective
AT tsamardinosioannis challengesintheanalysisofmassthroughputdataatechnicalcommentaryfromthestatisticalmachinelearningperspective