Cargando…

SED, a normalization free method for DNA microarray data analysis

BACKGROUND: Analysis of DNA microarray data usually begins with a normalization step where intensities of different arrays are adjusted to the same scale so that the intensity levels from different arrays can be compared with one other. Both simple total array intensity-based as well as more complex...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Huajun, Huang, Hui
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2004
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC517708/
https://www.ncbi.nlm.nih.gov/pubmed/15345033
http://dx.doi.org/10.1186/1471-2105-5-121
_version_ 1782121784360828928
author Wang, Huajun
Huang, Hui
author_facet Wang, Huajun
Huang, Hui
author_sort Wang, Huajun
collection PubMed
description BACKGROUND: Analysis of DNA microarray data usually begins with a normalization step where intensities of different arrays are adjusted to the same scale so that the intensity levels from different arrays can be compared with one other. Both simple total array intensity-based as well as more complex "local intensity level" dependent normalization methods have been developed, some of which are widely used. Much less developed methods for microarray data analysis include those that bypass the normalization step and therefore yield results that are not confounded by potential normalization errors. RESULTS: Instead of focusing on the raw intensity levels, we developed a new method for microarray data analysis that maps each gene's expression intensity level to a high dimensional space of SEDs (Signs of Expression Difference), the signs of the expression intensity difference between a given gene and every other gene on the array. Since SED are unchanged under any monotonic transformation of intensity levels, the SED based method is normalization free. When tested on a multi-class tumor classification problem, simple Naive Bayes and Nearest Neighbor methods using the SED approach gave results comparable with normalized intensity-based algorithms. Furthermore, a high percentage of classifiers based on a single gene's SED gave good classification results, suggesting that SED does capture essential information from the intensity levels. CONCLUSION: The results of testing this new method on multi-class tumor classification problems suggests that the SED-based, normalization-free method of microarray data analysis is feasible and promising.
format Text
id pubmed-517708
institution National Center for Biotechnology Information
language English
publishDate 2004
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-5177082004-09-19 SED, a normalization free method for DNA microarray data analysis Wang, Huajun Huang, Hui BMC Bioinformatics Research Article BACKGROUND: Analysis of DNA microarray data usually begins with a normalization step where intensities of different arrays are adjusted to the same scale so that the intensity levels from different arrays can be compared with one other. Both simple total array intensity-based as well as more complex "local intensity level" dependent normalization methods have been developed, some of which are widely used. Much less developed methods for microarray data analysis include those that bypass the normalization step and therefore yield results that are not confounded by potential normalization errors. RESULTS: Instead of focusing on the raw intensity levels, we developed a new method for microarray data analysis that maps each gene's expression intensity level to a high dimensional space of SEDs (Signs of Expression Difference), the signs of the expression intensity difference between a given gene and every other gene on the array. Since SED are unchanged under any monotonic transformation of intensity levels, the SED based method is normalization free. When tested on a multi-class tumor classification problem, simple Naive Bayes and Nearest Neighbor methods using the SED approach gave results comparable with normalized intensity-based algorithms. Furthermore, a high percentage of classifiers based on a single gene's SED gave good classification results, suggesting that SED does capture essential information from the intensity levels. CONCLUSION: The results of testing this new method on multi-class tumor classification problems suggests that the SED-based, normalization-free method of microarray data analysis is feasible and promising. BioMed Central 2004-09-02 /pmc/articles/PMC517708/ /pubmed/15345033 http://dx.doi.org/10.1186/1471-2105-5-121 Text en Copyright © 2004 Wang and Huang; licensee BioMed Central Ltd.
spellingShingle Research Article
Wang, Huajun
Huang, Hui
SED, a normalization free method for DNA microarray data analysis
title SED, a normalization free method for DNA microarray data analysis
title_full SED, a normalization free method for DNA microarray data analysis
title_fullStr SED, a normalization free method for DNA microarray data analysis
title_full_unstemmed SED, a normalization free method for DNA microarray data analysis
title_short SED, a normalization free method for DNA microarray data analysis
title_sort sed, a normalization free method for dna microarray data analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC517708/
https://www.ncbi.nlm.nih.gov/pubmed/15345033
http://dx.doi.org/10.1186/1471-2105-5-121
work_keys_str_mv AT wanghuajun sedanormalizationfreemethodfordnamicroarraydataanalysis
AT huanghui sedanormalizationfreemethodfordnamicroarraydataanalysis