Cargando…

MixHMM: Inferring Copy Number Variation and Allelic Imbalance Using SNP Arrays and Tumor Samples Mixed with Stromal Cells

BACKGROUND: Genotyping platforms such as single nucleotide polymorphism (SNP) arrays are powerful tools to study genomic aberrations in cancer samples. Allele specific information from SNP arrays provides valuable information for interpreting copy number variation (CNV) and allelic imbalance includi...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Zongzhi, Li, Ao, Schulz, Vincent, Chen, Min, Tuck, David
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2879364/
https://www.ncbi.nlm.nih.gov/pubmed/20532221
http://dx.doi.org/10.1371/journal.pone.0010909
_version_ 1782181919335645184
author Liu, Zongzhi
Li, Ao
Schulz, Vincent
Chen, Min
Tuck, David
author_facet Liu, Zongzhi
Li, Ao
Schulz, Vincent
Chen, Min
Tuck, David
author_sort Liu, Zongzhi
collection PubMed
description BACKGROUND: Genotyping platforms such as single nucleotide polymorphism (SNP) arrays are powerful tools to study genomic aberrations in cancer samples. Allele specific information from SNP arrays provides valuable information for interpreting copy number variation (CNV) and allelic imbalance including loss-of-heterozygosity (LOH) beyond that obtained from the total DNA signal available from array comparative genomic hybridization (aCGH) platforms. Several algorithms based on hidden Markov models (HMMs) have been designed to detect copy number changes and copy-neutral LOH making use of the allele information on SNP arrays. However heterogeneity in clinical samples, due to stromal contamination and somatic alterations, complicates analysis and interpretation of these data. METHODS: We have developed MixHMM, a novel hidden Markov model using hidden states based on chromosomal structural aberrations. MixHMM allows CNV detection for copy numbers up to 7 and allows more complete and accurate description of other forms of allelic imbalance, such as increased copy number LOH or imbalanced amplifications. MixHMM also incorporates a novel sample mixing model that allows detection of tumor CNV events in heterogeneous tumor samples, where cancer cells are mixed with a proportion of stromal cells. CONCLUSIONS: We validate MixHMM and demonstrate its advantages with simulated samples, clinical tumor samples and a dilution series of mixed samples. We have shown that the CNVs of cancer cells in a tumor sample contaminated with up to 80% of stromal cells can be detected accurately using Illumina BeadChip and MixHMM. AVAILABILITY: The MixHMM is available as a Python package provided with some other useful tools at http://genecube.med.yale.edu:8080/MixHMM.
format Text
id pubmed-2879364
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-28793642010-06-07 MixHMM: Inferring Copy Number Variation and Allelic Imbalance Using SNP Arrays and Tumor Samples Mixed with Stromal Cells Liu, Zongzhi Li, Ao Schulz, Vincent Chen, Min Tuck, David PLoS One Research Article BACKGROUND: Genotyping platforms such as single nucleotide polymorphism (SNP) arrays are powerful tools to study genomic aberrations in cancer samples. Allele specific information from SNP arrays provides valuable information for interpreting copy number variation (CNV) and allelic imbalance including loss-of-heterozygosity (LOH) beyond that obtained from the total DNA signal available from array comparative genomic hybridization (aCGH) platforms. Several algorithms based on hidden Markov models (HMMs) have been designed to detect copy number changes and copy-neutral LOH making use of the allele information on SNP arrays. However heterogeneity in clinical samples, due to stromal contamination and somatic alterations, complicates analysis and interpretation of these data. METHODS: We have developed MixHMM, a novel hidden Markov model using hidden states based on chromosomal structural aberrations. MixHMM allows CNV detection for copy numbers up to 7 and allows more complete and accurate description of other forms of allelic imbalance, such as increased copy number LOH or imbalanced amplifications. MixHMM also incorporates a novel sample mixing model that allows detection of tumor CNV events in heterogeneous tumor samples, where cancer cells are mixed with a proportion of stromal cells. CONCLUSIONS: We validate MixHMM and demonstrate its advantages with simulated samples, clinical tumor samples and a dilution series of mixed samples. We have shown that the CNVs of cancer cells in a tumor sample contaminated with up to 80% of stromal cells can be detected accurately using Illumina BeadChip and MixHMM. AVAILABILITY: The MixHMM is available as a Python package provided with some other useful tools at http://genecube.med.yale.edu:8080/MixHMM. Public Library of Science 2010-06-01 /pmc/articles/PMC2879364/ /pubmed/20532221 http://dx.doi.org/10.1371/journal.pone.0010909 Text en Liu et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Liu, Zongzhi
Li, Ao
Schulz, Vincent
Chen, Min
Tuck, David
MixHMM: Inferring Copy Number Variation and Allelic Imbalance Using SNP Arrays and Tumor Samples Mixed with Stromal Cells
title MixHMM: Inferring Copy Number Variation and Allelic Imbalance Using SNP Arrays and Tumor Samples Mixed with Stromal Cells
title_full MixHMM: Inferring Copy Number Variation and Allelic Imbalance Using SNP Arrays and Tumor Samples Mixed with Stromal Cells
title_fullStr MixHMM: Inferring Copy Number Variation and Allelic Imbalance Using SNP Arrays and Tumor Samples Mixed with Stromal Cells
title_full_unstemmed MixHMM: Inferring Copy Number Variation and Allelic Imbalance Using SNP Arrays and Tumor Samples Mixed with Stromal Cells
title_short MixHMM: Inferring Copy Number Variation and Allelic Imbalance Using SNP Arrays and Tumor Samples Mixed with Stromal Cells
title_sort mixhmm: inferring copy number variation and allelic imbalance using snp arrays and tumor samples mixed with stromal cells
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2879364/
https://www.ncbi.nlm.nih.gov/pubmed/20532221
http://dx.doi.org/10.1371/journal.pone.0010909
work_keys_str_mv AT liuzongzhi mixhmminferringcopynumbervariationandallelicimbalanceusingsnparraysandtumorsamplesmixedwithstromalcells
AT liao mixhmminferringcopynumbervariationandallelicimbalanceusingsnparraysandtumorsamplesmixedwithstromalcells
AT schulzvincent mixhmminferringcopynumbervariationandallelicimbalanceusingsnparraysandtumorsamplesmixedwithstromalcells
AT chenmin mixhmminferringcopynumbervariationandallelicimbalanceusingsnparraysandtumorsamplesmixedwithstromalcells
AT tuckdavid mixhmminferringcopynumbervariationandallelicimbalanceusingsnparraysandtumorsamplesmixedwithstromalcells