Cargando…

A hidden Markov model-based algorithm for identifying tumour subtype using array CGH data

BACKGROUND: The recent advancement in array CGH (aCGH) research has significantly improved tumor identification using DNA copy number data. A number of unsupervised learning methods have been proposed for clustering aCGH samples. Two of the major challenges for developing aCGH sample clustering are...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Ke, Yang, Yi, Devanarayan, Viswanath, Xie, Linglin, Deng, Youping, Donald, Sens
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287492/
https://www.ncbi.nlm.nih.gov/pubmed/22369459
http://dx.doi.org/10.1186/1471-2164-12-S5-S10
_version_ 1782224675519070208
author Zhang, Ke
Yang, Yi
Devanarayan, Viswanath
Xie, Linglin
Deng, Youping
Donald, Sens
author_facet Zhang, Ke
Yang, Yi
Devanarayan, Viswanath
Xie, Linglin
Deng, Youping
Donald, Sens
author_sort Zhang, Ke
collection PubMed
description BACKGROUND: The recent advancement in array CGH (aCGH) research has significantly improved tumor identification using DNA copy number data. A number of unsupervised learning methods have been proposed for clustering aCGH samples. Two of the major challenges for developing aCGH sample clustering are the high spatial correlation between aCGH markers and the low computing efficiency. A mixture hidden Markov model based algorithm was developed to address these two challenges. RESULTS: The hidden Markov model (HMM) was used to model the spatial correlation between aCGH markers. A fast clustering algorithm was implemented and real data analysis on glioma aCGH data has shown that it converges to the optimal cluster rapidly and the computation time is proportional to the sample size. Simulation results showed that this HMM based clustering (HMMC) method has a substantially lower error rate than NMF clustering. The HMMC results for glioma data were significantly associated with clinical outcomes. CONCLUSIONS: We have developed a fast clustering algorithm to identify tumor subtypes based on DNA copy number aberrations. The performance of the proposed HMMC method has been evaluated using both simulated and real aCGH data. The software for HMMC in both R and C++ is available in ND INBRE website http://ndinbre.org/programs/bioinformatics.php.
format Online
Article
Text
id pubmed-3287492
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32874922012-03-01 A hidden Markov model-based algorithm for identifying tumour subtype using array CGH data Zhang, Ke Yang, Yi Devanarayan, Viswanath Xie, Linglin Deng, Youping Donald, Sens BMC Genomics Research Article BACKGROUND: The recent advancement in array CGH (aCGH) research has significantly improved tumor identification using DNA copy number data. A number of unsupervised learning methods have been proposed for clustering aCGH samples. Two of the major challenges for developing aCGH sample clustering are the high spatial correlation between aCGH markers and the low computing efficiency. A mixture hidden Markov model based algorithm was developed to address these two challenges. RESULTS: The hidden Markov model (HMM) was used to model the spatial correlation between aCGH markers. A fast clustering algorithm was implemented and real data analysis on glioma aCGH data has shown that it converges to the optimal cluster rapidly and the computation time is proportional to the sample size. Simulation results showed that this HMM based clustering (HMMC) method has a substantially lower error rate than NMF clustering. The HMMC results for glioma data were significantly associated with clinical outcomes. CONCLUSIONS: We have developed a fast clustering algorithm to identify tumor subtypes based on DNA copy number aberrations. The performance of the proposed HMMC method has been evaluated using both simulated and real aCGH data. The software for HMMC in both R and C++ is available in ND INBRE website http://ndinbre.org/programs/bioinformatics.php. BioMed Central 2011-12-23 /pmc/articles/PMC3287492/ /pubmed/22369459 http://dx.doi.org/10.1186/1471-2164-12-S5-S10 Text en Copyright ©2011 Zhang et al. licensee BioMed Central Ltd http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhang, Ke
Yang, Yi
Devanarayan, Viswanath
Xie, Linglin
Deng, Youping
Donald, Sens
A hidden Markov model-based algorithm for identifying tumour subtype using array CGH data
title A hidden Markov model-based algorithm for identifying tumour subtype using array CGH data
title_full A hidden Markov model-based algorithm for identifying tumour subtype using array CGH data
title_fullStr A hidden Markov model-based algorithm for identifying tumour subtype using array CGH data
title_full_unstemmed A hidden Markov model-based algorithm for identifying tumour subtype using array CGH data
title_short A hidden Markov model-based algorithm for identifying tumour subtype using array CGH data
title_sort hidden markov model-based algorithm for identifying tumour subtype using array cgh data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287492/
https://www.ncbi.nlm.nih.gov/pubmed/22369459
http://dx.doi.org/10.1186/1471-2164-12-S5-S10
work_keys_str_mv AT zhangke ahiddenmarkovmodelbasedalgorithmforidentifyingtumoursubtypeusingarraycghdata
AT yangyi ahiddenmarkovmodelbasedalgorithmforidentifyingtumoursubtypeusingarraycghdata
AT devanarayanviswanath ahiddenmarkovmodelbasedalgorithmforidentifyingtumoursubtypeusingarraycghdata
AT xielinglin ahiddenmarkovmodelbasedalgorithmforidentifyingtumoursubtypeusingarraycghdata
AT dengyouping ahiddenmarkovmodelbasedalgorithmforidentifyingtumoursubtypeusingarraycghdata
AT donaldsens ahiddenmarkovmodelbasedalgorithmforidentifyingtumoursubtypeusingarraycghdata
AT zhangke hiddenmarkovmodelbasedalgorithmforidentifyingtumoursubtypeusingarraycghdata
AT yangyi hiddenmarkovmodelbasedalgorithmforidentifyingtumoursubtypeusingarraycghdata
AT devanarayanviswanath hiddenmarkovmodelbasedalgorithmforidentifyingtumoursubtypeusingarraycghdata
AT xielinglin hiddenmarkovmodelbasedalgorithmforidentifyingtumoursubtypeusingarraycghdata
AT dengyouping hiddenmarkovmodelbasedalgorithmforidentifyingtumoursubtypeusingarraycghdata
AT donaldsens hiddenmarkovmodelbasedalgorithmforidentifyingtumoursubtypeusingarraycghdata