Cargando…

GPHMM: an integrated hidden Markov model for identification of copy number alteration and loss of heterozygosity in complex tumor samples using whole genome SNP arrays

There is an increasing interest in using single nucleotide polymorphism (SNP) genotyping arrays for profiling chromosomal rearrangements in tumors, as they allow simultaneous detection of copy number and loss of heterozygosity with high resolution. Critical issues such as signal baseline shift due t...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Ao, Liu, Zongzhi, Lezon-Geyda, Kimberly, Sarkar, Sudipa, Lannin, Donald, Schulz, Vincent, Krop, Ian, Winer, Eric, Harris, Lyndsay, Tuck, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3130254/
https://www.ncbi.nlm.nih.gov/pubmed/21398628
http://dx.doi.org/10.1093/nar/gkr014
_version_ 1782207594859855872
author Li, Ao
Liu, Zongzhi
Lezon-Geyda, Kimberly
Sarkar, Sudipa
Lannin, Donald
Schulz, Vincent
Krop, Ian
Winer, Eric
Harris, Lyndsay
Tuck, David
author_facet Li, Ao
Liu, Zongzhi
Lezon-Geyda, Kimberly
Sarkar, Sudipa
Lannin, Donald
Schulz, Vincent
Krop, Ian
Winer, Eric
Harris, Lyndsay
Tuck, David
author_sort Li, Ao
collection PubMed
description There is an increasing interest in using single nucleotide polymorphism (SNP) genotyping arrays for profiling chromosomal rearrangements in tumors, as they allow simultaneous detection of copy number and loss of heterozygosity with high resolution. Critical issues such as signal baseline shift due to aneuploidy, normal cell contamination, and the presence of GC content bias have been reported to dramatically alter SNP array signals and complicate accurate identification of aberrations in cancer genomes. To address these issues, we propose a novel Global Parameter Hidden Markov Model (GPHMM) to unravel tangled genotyping data generated from tumor samples. In contrast to other HMM methods, a distinct feature of GPHMM is that the issues mentioned above are quantitatively modeled by global parameters and integrated within the statistical framework. We developed an efficient EM algorithm for parameter estimation. We evaluated performance on three data sets and show that GPHMM can correctly identify chromosomal aberrations in tumor samples containing as few as 10% cancer cells. Furthermore, we demonstrated that the estimation of global parameters in GPHMM provides information about the biological characteristics of tumor samples and the quality of genotyping signal from SNP array experiments, which is helpful for data quality control and outlier detection in cohort studies.
format Online
Article
Text
id pubmed-3130254
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-31302542011-07-06 GPHMM: an integrated hidden Markov model for identification of copy number alteration and loss of heterozygosity in complex tumor samples using whole genome SNP arrays Li, Ao Liu, Zongzhi Lezon-Geyda, Kimberly Sarkar, Sudipa Lannin, Donald Schulz, Vincent Krop, Ian Winer, Eric Harris, Lyndsay Tuck, David Nucleic Acids Res Computational Biology There is an increasing interest in using single nucleotide polymorphism (SNP) genotyping arrays for profiling chromosomal rearrangements in tumors, as they allow simultaneous detection of copy number and loss of heterozygosity with high resolution. Critical issues such as signal baseline shift due to aneuploidy, normal cell contamination, and the presence of GC content bias have been reported to dramatically alter SNP array signals and complicate accurate identification of aberrations in cancer genomes. To address these issues, we propose a novel Global Parameter Hidden Markov Model (GPHMM) to unravel tangled genotyping data generated from tumor samples. In contrast to other HMM methods, a distinct feature of GPHMM is that the issues mentioned above are quantitatively modeled by global parameters and integrated within the statistical framework. We developed an efficient EM algorithm for parameter estimation. We evaluated performance on three data sets and show that GPHMM can correctly identify chromosomal aberrations in tumor samples containing as few as 10% cancer cells. Furthermore, we demonstrated that the estimation of global parameters in GPHMM provides information about the biological characteristics of tumor samples and the quality of genotyping signal from SNP array experiments, which is helpful for data quality control and outlier detection in cohort studies. Oxford University Press 2011-07 2011-03-11 /pmc/articles/PMC3130254/ /pubmed/21398628 http://dx.doi.org/10.1093/nar/gkr014 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Li, Ao
Liu, Zongzhi
Lezon-Geyda, Kimberly
Sarkar, Sudipa
Lannin, Donald
Schulz, Vincent
Krop, Ian
Winer, Eric
Harris, Lyndsay
Tuck, David
GPHMM: an integrated hidden Markov model for identification of copy number alteration and loss of heterozygosity in complex tumor samples using whole genome SNP arrays
title GPHMM: an integrated hidden Markov model for identification of copy number alteration and loss of heterozygosity in complex tumor samples using whole genome SNP arrays
title_full GPHMM: an integrated hidden Markov model for identification of copy number alteration and loss of heterozygosity in complex tumor samples using whole genome SNP arrays
title_fullStr GPHMM: an integrated hidden Markov model for identification of copy number alteration and loss of heterozygosity in complex tumor samples using whole genome SNP arrays
title_full_unstemmed GPHMM: an integrated hidden Markov model for identification of copy number alteration and loss of heterozygosity in complex tumor samples using whole genome SNP arrays
title_short GPHMM: an integrated hidden Markov model for identification of copy number alteration and loss of heterozygosity in complex tumor samples using whole genome SNP arrays
title_sort gphmm: an integrated hidden markov model for identification of copy number alteration and loss of heterozygosity in complex tumor samples using whole genome snp arrays
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3130254/
https://www.ncbi.nlm.nih.gov/pubmed/21398628
http://dx.doi.org/10.1093/nar/gkr014
work_keys_str_mv AT liao gphmmanintegratedhiddenmarkovmodelforidentificationofcopynumberalterationandlossofheterozygosityincomplextumorsamplesusingwholegenomesnparrays
AT liuzongzhi gphmmanintegratedhiddenmarkovmodelforidentificationofcopynumberalterationandlossofheterozygosityincomplextumorsamplesusingwholegenomesnparrays
AT lezongeydakimberly gphmmanintegratedhiddenmarkovmodelforidentificationofcopynumberalterationandlossofheterozygosityincomplextumorsamplesusingwholegenomesnparrays
AT sarkarsudipa gphmmanintegratedhiddenmarkovmodelforidentificationofcopynumberalterationandlossofheterozygosityincomplextumorsamplesusingwholegenomesnparrays
AT lannindonald gphmmanintegratedhiddenmarkovmodelforidentificationofcopynumberalterationandlossofheterozygosityincomplextumorsamplesusingwholegenomesnparrays
AT schulzvincent gphmmanintegratedhiddenmarkovmodelforidentificationofcopynumberalterationandlossofheterozygosityincomplextumorsamplesusingwholegenomesnparrays
AT kropian gphmmanintegratedhiddenmarkovmodelforidentificationofcopynumberalterationandlossofheterozygosityincomplextumorsamplesusingwholegenomesnparrays
AT winereric gphmmanintegratedhiddenmarkovmodelforidentificationofcopynumberalterationandlossofheterozygosityincomplextumorsamplesusingwholegenomesnparrays
AT harrislyndsay gphmmanintegratedhiddenmarkovmodelforidentificationofcopynumberalterationandlossofheterozygosityincomplextumorsamplesusingwholegenomesnparrays
AT tuckdavid gphmmanintegratedhiddenmarkovmodelforidentificationofcopynumberalterationandlossofheterozygosityincomplextumorsamplesusingwholegenomesnparrays