Cargando…

The application of sparse estimation of covariance matrix to quadratic discriminant analysis

BACKGROUND: Although Linear Discriminant Analysis (LDA) is commonly used for classification, it may not be directly applied in genomics studies due to the large p, small n problem in these studies. Different versions of sparse LDA have been proposed to address this significant challenge. One implici...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Jiehuan, Zhao, Hongyu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4355996/
https://www.ncbi.nlm.nih.gov/pubmed/25886892
http://dx.doi.org/10.1186/s12859-014-0443-6
_version_ 1782360939951030272
author Sun, Jiehuan
Zhao, Hongyu
author_facet Sun, Jiehuan
Zhao, Hongyu
author_sort Sun, Jiehuan
collection PubMed
description BACKGROUND: Although Linear Discriminant Analysis (LDA) is commonly used for classification, it may not be directly applied in genomics studies due to the large p, small n problem in these studies. Different versions of sparse LDA have been proposed to address this significant challenge. One implicit assumption of various LDA-based methods is that the covariance matrices are the same across different classes. However, rewiring of genetic networks (therefore different covariance matrices) across different diseases has been observed in many genomics studies, which suggests that LDA and its variations may be suboptimal for disease classifications. However, it is not clear whether considering differing genetic networks across diseases can improve classification in genomics studies. RESULTS: We propose a sparse version of Quadratic Discriminant Analysis (SQDA) to explicitly consider the differences of the genetic networks across diseases. Both simulation and real data analysis are performed to compare the performance of SQDA with six commonly used classification methods. CONCLUSIONS: SQDA provides more accurate classification results than other methods for both simulated and real data. Our method should prove useful for classification in genomics studies and other research settings, where covariances differ among classes.
format Online
Article
Text
id pubmed-4355996
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43559962015-03-12 The application of sparse estimation of covariance matrix to quadratic discriminant analysis Sun, Jiehuan Zhao, Hongyu BMC Bioinformatics Methodology Article BACKGROUND: Although Linear Discriminant Analysis (LDA) is commonly used for classification, it may not be directly applied in genomics studies due to the large p, small n problem in these studies. Different versions of sparse LDA have been proposed to address this significant challenge. One implicit assumption of various LDA-based methods is that the covariance matrices are the same across different classes. However, rewiring of genetic networks (therefore different covariance matrices) across different diseases has been observed in many genomics studies, which suggests that LDA and its variations may be suboptimal for disease classifications. However, it is not clear whether considering differing genetic networks across diseases can improve classification in genomics studies. RESULTS: We propose a sparse version of Quadratic Discriminant Analysis (SQDA) to explicitly consider the differences of the genetic networks across diseases. Both simulation and real data analysis are performed to compare the performance of SQDA with six commonly used classification methods. CONCLUSIONS: SQDA provides more accurate classification results than other methods for both simulated and real data. Our method should prove useful for classification in genomics studies and other research settings, where covariances differ among classes. BioMed Central 2015-02-18 /pmc/articles/PMC4355996/ /pubmed/25886892 http://dx.doi.org/10.1186/s12859-014-0443-6 Text en © Sun and Zhao; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Sun, Jiehuan
Zhao, Hongyu
The application of sparse estimation of covariance matrix to quadratic discriminant analysis
title The application of sparse estimation of covariance matrix to quadratic discriminant analysis
title_full The application of sparse estimation of covariance matrix to quadratic discriminant analysis
title_fullStr The application of sparse estimation of covariance matrix to quadratic discriminant analysis
title_full_unstemmed The application of sparse estimation of covariance matrix to quadratic discriminant analysis
title_short The application of sparse estimation of covariance matrix to quadratic discriminant analysis
title_sort application of sparse estimation of covariance matrix to quadratic discriminant analysis
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4355996/
https://www.ncbi.nlm.nih.gov/pubmed/25886892
http://dx.doi.org/10.1186/s12859-014-0443-6
work_keys_str_mv AT sunjiehuan theapplicationofsparseestimationofcovariancematrixtoquadraticdiscriminantanalysis
AT zhaohongyu theapplicationofsparseestimationofcovariancematrixtoquadraticdiscriminantanalysis
AT sunjiehuan applicationofsparseestimationofcovariancematrixtoquadraticdiscriminantanalysis
AT zhaohongyu applicationofsparseestimationofcovariancematrixtoquadraticdiscriminantanalysis