Cargando…

k-Nearest neighbor models for microarray gene expression analysis and clinical outcome prediction

In the clinical application of genomic data analysis and modeling, a number of factors contribute to the performance of disease classification and clinical outcome prediction. This study focuses on the k-nearest neighbor (KNN) modeling strategy and its clinical use. Although KNN is simple and clinic...

Descripción completa

Detalles Bibliográficos
Autores principales: Parry, R M, Jones, W, Stokes, T H, Phan, J H, Moffitt, R A, Fang, H, Shi, L, Oberthuer, A, Fischer, M, Tong, W, Wang, M D
Formato: Texto
Lenguaje:English
Publicado: Nature Publishing Group 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2920072/
https://www.ncbi.nlm.nih.gov/pubmed/20676068
http://dx.doi.org/10.1038/tpj.2010.56
_version_ 1782185239300276224
author Parry, R M
Jones, W
Stokes, T H
Phan, J H
Moffitt, R A
Fang, H
Shi, L
Oberthuer, A
Fischer, M
Tong, W
Wang, M D
author_facet Parry, R M
Jones, W
Stokes, T H
Phan, J H
Moffitt, R A
Fang, H
Shi, L
Oberthuer, A
Fischer, M
Tong, W
Wang, M D
author_sort Parry, R M
collection PubMed
description In the clinical application of genomic data analysis and modeling, a number of factors contribute to the performance of disease classification and clinical outcome prediction. This study focuses on the k-nearest neighbor (KNN) modeling strategy and its clinical use. Although KNN is simple and clinically appealing, large performance variations were found among experienced data analysis teams in the MicroArray Quality Control Phase II (MAQC-II) project. For clinical end points and controls from breast cancer, neuroblastoma and multiple myeloma, we systematically generated 463 320 KNN models by varying feature ranking method, number of features, distance metric, number of neighbors, vote weighting and decision threshold. We identified factors that contribute to the MAQC-II project performance variation, and validated a KNN data analysis protocol using a newly generated clinical data set with 478 neuroblastoma patients. We interpreted the biological and practical significance of the derived KNN models, and compared their performance with existing clinical factors.
format Text
id pubmed-2920072
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-29200722010-08-25 k-Nearest neighbor models for microarray gene expression analysis and clinical outcome prediction Parry, R M Jones, W Stokes, T H Phan, J H Moffitt, R A Fang, H Shi, L Oberthuer, A Fischer, M Tong, W Wang, M D Pharmacogenomics J Original Article In the clinical application of genomic data analysis and modeling, a number of factors contribute to the performance of disease classification and clinical outcome prediction. This study focuses on the k-nearest neighbor (KNN) modeling strategy and its clinical use. Although KNN is simple and clinically appealing, large performance variations were found among experienced data analysis teams in the MicroArray Quality Control Phase II (MAQC-II) project. For clinical end points and controls from breast cancer, neuroblastoma and multiple myeloma, we systematically generated 463 320 KNN models by varying feature ranking method, number of features, distance metric, number of neighbors, vote weighting and decision threshold. We identified factors that contribute to the MAQC-II project performance variation, and validated a KNN data analysis protocol using a newly generated clinical data set with 478 neuroblastoma patients. We interpreted the biological and practical significance of the derived KNN models, and compared their performance with existing clinical factors. Nature Publishing Group 2010-08 2010-07-30 /pmc/articles/PMC2920072/ /pubmed/20676068 http://dx.doi.org/10.1038/tpj.2010.56 Text en Copyright © 2010 Macmillan Publishers Limited http://creativecommons.org/licenses/by-nc-nd/3.0/ This work is licensed under the Creative Commons Attribution-NonCommercial-No Derivative Works 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/3.0/
spellingShingle Original Article
Parry, R M
Jones, W
Stokes, T H
Phan, J H
Moffitt, R A
Fang, H
Shi, L
Oberthuer, A
Fischer, M
Tong, W
Wang, M D
k-Nearest neighbor models for microarray gene expression analysis and clinical outcome prediction
title k-Nearest neighbor models for microarray gene expression analysis and clinical outcome prediction
title_full k-Nearest neighbor models for microarray gene expression analysis and clinical outcome prediction
title_fullStr k-Nearest neighbor models for microarray gene expression analysis and clinical outcome prediction
title_full_unstemmed k-Nearest neighbor models for microarray gene expression analysis and clinical outcome prediction
title_short k-Nearest neighbor models for microarray gene expression analysis and clinical outcome prediction
title_sort k-nearest neighbor models for microarray gene expression analysis and clinical outcome prediction
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2920072/
https://www.ncbi.nlm.nih.gov/pubmed/20676068
http://dx.doi.org/10.1038/tpj.2010.56
work_keys_str_mv AT parryrm knearestneighbormodelsformicroarraygeneexpressionanalysisandclinicaloutcomeprediction
AT jonesw knearestneighbormodelsformicroarraygeneexpressionanalysisandclinicaloutcomeprediction
AT stokesth knearestneighbormodelsformicroarraygeneexpressionanalysisandclinicaloutcomeprediction
AT phanjh knearestneighbormodelsformicroarraygeneexpressionanalysisandclinicaloutcomeprediction
AT moffittra knearestneighbormodelsformicroarraygeneexpressionanalysisandclinicaloutcomeprediction
AT fangh knearestneighbormodelsformicroarraygeneexpressionanalysisandclinicaloutcomeprediction
AT shil knearestneighbormodelsformicroarraygeneexpressionanalysisandclinicaloutcomeprediction
AT oberthuera knearestneighbormodelsformicroarraygeneexpressionanalysisandclinicaloutcomeprediction
AT fischerm knearestneighbormodelsformicroarraygeneexpressionanalysisandclinicaloutcomeprediction
AT tongw knearestneighbormodelsformicroarraygeneexpressionanalysisandclinicaloutcomeprediction
AT wangmd knearestneighbormodelsformicroarraygeneexpressionanalysisandclinicaloutcomeprediction