Cargando…

Small Sample Issues for Microarray-Based Classification

In order to study the molecular biological differences between normal and diseased tissues, it is desirable to perform classification among diseases and stages of disease using microarray-based gene-expression values. Owing to the limited number of microarrays typically used in these studies, seriou...

Descripción completa

Detalles Bibliográficos
Autor principal: Dougherty, Edward R.
Formato: Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2001
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447190/
https://www.ncbi.nlm.nih.gov/pubmed/18628896
http://dx.doi.org/10.1002/cfg.62
_version_ 1782156878652899328
author Dougherty, Edward R.
author_facet Dougherty, Edward R.
author_sort Dougherty, Edward R.
collection PubMed
description In order to study the molecular biological differences between normal and diseased tissues, it is desirable to perform classification among diseases and stages of disease using microarray-based gene-expression values. Owing to the limited number of microarrays typically used in these studies, serious issues arise with respect to the design, performance and analysis of classifiers based on microarray data. This paper reviews some fundamental issues facing small-sample classification: classification rules, constrained classifiers, error estimation and feature selection. It discusses both unconstrained and constrained classifier design from sample data, and the contributions to classifier error from constrained optimization and lack of optimality owing to design from sample data. The difficulty with estimating classifier error when confined to small samples is addressed, particularly estimating the error from training data. The impact of small samples on the ability to include more than a few variables as classifier features is explained.
format Text
id pubmed-2447190
institution National Center for Biotechnology Information
language English
publishDate 2001
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-24471902008-07-14 Small Sample Issues for Microarray-Based Classification Dougherty, Edward R. Comp Funct Genomics Research Article In order to study the molecular biological differences between normal and diseased tissues, it is desirable to perform classification among diseases and stages of disease using microarray-based gene-expression values. Owing to the limited number of microarrays typically used in these studies, serious issues arise with respect to the design, performance and analysis of classifiers based on microarray data. This paper reviews some fundamental issues facing small-sample classification: classification rules, constrained classifiers, error estimation and feature selection. It discusses both unconstrained and constrained classifier design from sample data, and the contributions to classifier error from constrained optimization and lack of optimality owing to design from sample data. The difficulty with estimating classifier error when confined to small samples is addressed, particularly estimating the error from training data. The impact of small samples on the ability to include more than a few variables as classifier features is explained. Hindawi Publishing Corporation 2001-02 /pmc/articles/PMC2447190/ /pubmed/18628896 http://dx.doi.org/10.1002/cfg.62 Text en Copyright © 2001 Hindawi Publishing Corporation. http://creativecommons.org/licenses/by/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Dougherty, Edward R.
Small Sample Issues for Microarray-Based Classification
title Small Sample Issues for Microarray-Based Classification
title_full Small Sample Issues for Microarray-Based Classification
title_fullStr Small Sample Issues for Microarray-Based Classification
title_full_unstemmed Small Sample Issues for Microarray-Based Classification
title_short Small Sample Issues for Microarray-Based Classification
title_sort small sample issues for microarray-based classification
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447190/
https://www.ncbi.nlm.nih.gov/pubmed/18628896
http://dx.doi.org/10.1002/cfg.62
work_keys_str_mv AT doughertyedwardr smallsampleissuesformicroarraybasedclassification