Cargando…
SED, a normalization free method for DNA microarray data analysis
BACKGROUND: Analysis of DNA microarray data usually begins with a normalization step where intensities of different arrays are adjusted to the same scale so that the intensity levels from different arrays can be compared with one other. Both simple total array intensity-based as well as more complex...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2004
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC517708/ https://www.ncbi.nlm.nih.gov/pubmed/15345033 http://dx.doi.org/10.1186/1471-2105-5-121 |
_version_ | 1782121784360828928 |
---|---|
author | Wang, Huajun Huang, Hui |
author_facet | Wang, Huajun Huang, Hui |
author_sort | Wang, Huajun |
collection | PubMed |
description | BACKGROUND: Analysis of DNA microarray data usually begins with a normalization step where intensities of different arrays are adjusted to the same scale so that the intensity levels from different arrays can be compared with one other. Both simple total array intensity-based as well as more complex "local intensity level" dependent normalization methods have been developed, some of which are widely used. Much less developed methods for microarray data analysis include those that bypass the normalization step and therefore yield results that are not confounded by potential normalization errors. RESULTS: Instead of focusing on the raw intensity levels, we developed a new method for microarray data analysis that maps each gene's expression intensity level to a high dimensional space of SEDs (Signs of Expression Difference), the signs of the expression intensity difference between a given gene and every other gene on the array. Since SED are unchanged under any monotonic transformation of intensity levels, the SED based method is normalization free. When tested on a multi-class tumor classification problem, simple Naive Bayes and Nearest Neighbor methods using the SED approach gave results comparable with normalized intensity-based algorithms. Furthermore, a high percentage of classifiers based on a single gene's SED gave good classification results, suggesting that SED does capture essential information from the intensity levels. CONCLUSION: The results of testing this new method on multi-class tumor classification problems suggests that the SED-based, normalization-free method of microarray data analysis is feasible and promising. |
format | Text |
id | pubmed-517708 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2004 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-5177082004-09-19 SED, a normalization free method for DNA microarray data analysis Wang, Huajun Huang, Hui BMC Bioinformatics Research Article BACKGROUND: Analysis of DNA microarray data usually begins with a normalization step where intensities of different arrays are adjusted to the same scale so that the intensity levels from different arrays can be compared with one other. Both simple total array intensity-based as well as more complex "local intensity level" dependent normalization methods have been developed, some of which are widely used. Much less developed methods for microarray data analysis include those that bypass the normalization step and therefore yield results that are not confounded by potential normalization errors. RESULTS: Instead of focusing on the raw intensity levels, we developed a new method for microarray data analysis that maps each gene's expression intensity level to a high dimensional space of SEDs (Signs of Expression Difference), the signs of the expression intensity difference between a given gene and every other gene on the array. Since SED are unchanged under any monotonic transformation of intensity levels, the SED based method is normalization free. When tested on a multi-class tumor classification problem, simple Naive Bayes and Nearest Neighbor methods using the SED approach gave results comparable with normalized intensity-based algorithms. Furthermore, a high percentage of classifiers based on a single gene's SED gave good classification results, suggesting that SED does capture essential information from the intensity levels. CONCLUSION: The results of testing this new method on multi-class tumor classification problems suggests that the SED-based, normalization-free method of microarray data analysis is feasible and promising. BioMed Central 2004-09-02 /pmc/articles/PMC517708/ /pubmed/15345033 http://dx.doi.org/10.1186/1471-2105-5-121 Text en Copyright © 2004 Wang and Huang; licensee BioMed Central Ltd. |
spellingShingle | Research Article Wang, Huajun Huang, Hui SED, a normalization free method for DNA microarray data analysis |
title | SED, a normalization free method for DNA microarray data analysis |
title_full | SED, a normalization free method for DNA microarray data analysis |
title_fullStr | SED, a normalization free method for DNA microarray data analysis |
title_full_unstemmed | SED, a normalization free method for DNA microarray data analysis |
title_short | SED, a normalization free method for DNA microarray data analysis |
title_sort | sed, a normalization free method for dna microarray data analysis |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC517708/ https://www.ncbi.nlm.nih.gov/pubmed/15345033 http://dx.doi.org/10.1186/1471-2105-5-121 |
work_keys_str_mv | AT wanghuajun sedanormalizationfreemethodfordnamicroarraydataanalysis AT huanghui sedanormalizationfreemethodfordnamicroarraydataanalysis |