Cargando…

A novel approach to detect hot-spots in large-scale multivariate data

BACKGROUND: Progressive advances in the measurement of complex multifactorial components of biological processes involving both spatial and temporal domains have made it difficult to identify the variables (genes, proteins, neurons etc.) significantly changed activities in response to a stimulus wit...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Jianhua, Kendrick, Keith M, Feng, Jianfeng
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2045117/
https://www.ncbi.nlm.nih.gov/pubmed/17848185
http://dx.doi.org/10.1186/1471-2105-8-331
_version_ 1782137137397760000
author Wu, Jianhua
Kendrick, Keith M
Feng, Jianfeng
author_facet Wu, Jianhua
Kendrick, Keith M
Feng, Jianfeng
author_sort Wu, Jianhua
collection PubMed
description BACKGROUND: Progressive advances in the measurement of complex multifactorial components of biological processes involving both spatial and temporal domains have made it difficult to identify the variables (genes, proteins, neurons etc.) significantly changed activities in response to a stimulus within large data sets using conventional statistical approaches. The set of all changed variables is termed hot-spots. The detection of such hot spots is considered to be an NP hard problem, but by first establishing its theoretical foundation we have been able to develop an algorithm that provides a solution. RESULTS: Our results show that a first-order phase transition is observable whose critical point separates the hot-spot set from the remaining variables. Its application is also found to be more successful than existing approaches in identifying statistically significant hot-spots both with simulated data sets and in real large-scale multivariate data sets from gene arrays, electrophysiological recording and functional magnetic resonance imaging experiments. CONCLUSION: In summary, this new statistical algorithm should provide a powerful new analytical tool to extract the maximum information from complex biological multivariate data.
format Text
id pubmed-2045117
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-20451172007-10-31 A novel approach to detect hot-spots in large-scale multivariate data Wu, Jianhua Kendrick, Keith M Feng, Jianfeng BMC Bioinformatics Methodology Article BACKGROUND: Progressive advances in the measurement of complex multifactorial components of biological processes involving both spatial and temporal domains have made it difficult to identify the variables (genes, proteins, neurons etc.) significantly changed activities in response to a stimulus within large data sets using conventional statistical approaches. The set of all changed variables is termed hot-spots. The detection of such hot spots is considered to be an NP hard problem, but by first establishing its theoretical foundation we have been able to develop an algorithm that provides a solution. RESULTS: Our results show that a first-order phase transition is observable whose critical point separates the hot-spot set from the remaining variables. Its application is also found to be more successful than existing approaches in identifying statistically significant hot-spots both with simulated data sets and in real large-scale multivariate data sets from gene arrays, electrophysiological recording and functional magnetic resonance imaging experiments. CONCLUSION: In summary, this new statistical algorithm should provide a powerful new analytical tool to extract the maximum information from complex biological multivariate data. BioMed Central 2007-09-11 /pmc/articles/PMC2045117/ /pubmed/17848185 http://dx.doi.org/10.1186/1471-2105-8-331 Text en Copyright © 2007 Wu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Wu, Jianhua
Kendrick, Keith M
Feng, Jianfeng
A novel approach to detect hot-spots in large-scale multivariate data
title A novel approach to detect hot-spots in large-scale multivariate data
title_full A novel approach to detect hot-spots in large-scale multivariate data
title_fullStr A novel approach to detect hot-spots in large-scale multivariate data
title_full_unstemmed A novel approach to detect hot-spots in large-scale multivariate data
title_short A novel approach to detect hot-spots in large-scale multivariate data
title_sort novel approach to detect hot-spots in large-scale multivariate data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2045117/
https://www.ncbi.nlm.nih.gov/pubmed/17848185
http://dx.doi.org/10.1186/1471-2105-8-331
work_keys_str_mv AT wujianhua anovelapproachtodetecthotspotsinlargescalemultivariatedata
AT kendrickkeithm anovelapproachtodetecthotspotsinlargescalemultivariatedata
AT fengjianfeng anovelapproachtodetecthotspotsinlargescalemultivariatedata
AT wujianhua novelapproachtodetecthotspotsinlargescalemultivariatedata
AT kendrickkeithm novelapproachtodetecthotspotsinlargescalemultivariatedata
AT fengjianfeng novelapproachtodetecthotspotsinlargescalemultivariatedata