Cargando…

New theory of discriminant analysis after R. Fisher: advanced research by the feature selection method for microarray data

This is the first book to compare eight LDFs by different types of datasets, such as Fisher’s iris data, medical data with collinearities, Swiss banknote data that is a linearly separable data (LSD), student pass/fail determination using student attributes, 18 pass/fail determinations using exam sco...

Descripción completa

Detalles Bibliográficos
Autor principal: Shinmura, Shuichi
Lenguaje:eng
Publicado: Springer 2016
Materias:
Acceso en línea:https://dx.doi.org/10.1007/978-981-10-2164-0
http://cds.cern.ch/record/2240974
_version_ 1780953146660487168
author Shinmura, Shuichi
author_facet Shinmura, Shuichi
author_sort Shinmura, Shuichi
collection CERN
description This is the first book to compare eight LDFs by different types of datasets, such as Fisher’s iris data, medical data with collinearities, Swiss banknote data that is a linearly separable data (LSD), student pass/fail determination using student attributes, 18 pass/fail determinations using exam scores, Japanese automobile data, and six microarray datasets (the datasets) that are LSD. We developed the 100-fold cross-validation for the small sample method (Method 1) instead of the LOO method. We proposed a simple model selection procedure to choose the best model having minimum M2 and Revised IP-OLDF based on MNM criterion was found to be better than other M2s in the above datasets. We compared two statistical LDFs and six MP-based LDFs. Those were Fisher’s LDF, logistic regression, three SVMs, Revised IP-OLDF, and another two OLDFs. Only a hard-margin SVM (H-SVM) and Revised IP-OLDF could discriminate LSD theoretically (Problem 2). We solved the defect of the generalized inverse matrices (Problem 3). For more than 10 years, many researchers have struggled to analyze the microarray dataset that is LSD (Problem 5). If we call the linearly separable model "Matroska," the dataset consists of numerous smaller Matroskas in it. We develop the Matroska feature selection method (Method 2). It finds the surprising structure of the dataset that is the disjoint union of several small Matroskas. Our theory and methods reveal new facts of gene analysis.
id cern-2240974
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2016
publisher Springer
record_format invenio
spelling cern-22409742021-04-21T19:23:13Zdoi:10.1007/978-981-10-2164-0http://cds.cern.ch/record/2240974engShinmura, ShuichiNew theory of discriminant analysis after R. Fisher: advanced research by the feature selection method for microarray dataMathematical Physics and MathematicsThis is the first book to compare eight LDFs by different types of datasets, such as Fisher’s iris data, medical data with collinearities, Swiss banknote data that is a linearly separable data (LSD), student pass/fail determination using student attributes, 18 pass/fail determinations using exam scores, Japanese automobile data, and six microarray datasets (the datasets) that are LSD. We developed the 100-fold cross-validation for the small sample method (Method 1) instead of the LOO method. We proposed a simple model selection procedure to choose the best model having minimum M2 and Revised IP-OLDF based on MNM criterion was found to be better than other M2s in the above datasets. We compared two statistical LDFs and six MP-based LDFs. Those were Fisher’s LDF, logistic regression, three SVMs, Revised IP-OLDF, and another two OLDFs. Only a hard-margin SVM (H-SVM) and Revised IP-OLDF could discriminate LSD theoretically (Problem 2). We solved the defect of the generalized inverse matrices (Problem 3). For more than 10 years, many researchers have struggled to analyze the microarray dataset that is LSD (Problem 5). If we call the linearly separable model "Matroska," the dataset consists of numerous smaller Matroskas in it. We develop the Matroska feature selection method (Method 2). It finds the surprising structure of the dataset that is the disjoint union of several small Matroskas. Our theory and methods reveal new facts of gene analysis.Springeroai:cds.cern.ch:22409742016
spellingShingle Mathematical Physics and Mathematics
Shinmura, Shuichi
New theory of discriminant analysis after R. Fisher: advanced research by the feature selection method for microarray data
title New theory of discriminant analysis after R. Fisher: advanced research by the feature selection method for microarray data
title_full New theory of discriminant analysis after R. Fisher: advanced research by the feature selection method for microarray data
title_fullStr New theory of discriminant analysis after R. Fisher: advanced research by the feature selection method for microarray data
title_full_unstemmed New theory of discriminant analysis after R. Fisher: advanced research by the feature selection method for microarray data
title_short New theory of discriminant analysis after R. Fisher: advanced research by the feature selection method for microarray data
title_sort new theory of discriminant analysis after r. fisher: advanced research by the feature selection method for microarray data
topic Mathematical Physics and Mathematics
url https://dx.doi.org/10.1007/978-981-10-2164-0
http://cds.cern.ch/record/2240974
work_keys_str_mv AT shinmurashuichi newtheoryofdiscriminantanalysisafterrfisheradvancedresearchbythefeatureselectionmethodformicroarraydata