Cargando…

Mutation Discovery in Regions of Segmental Cancer Genome Amplifications with CoNAn-SNV: A Mixture Model for Next Generation Sequencing of Tumors

Next generation sequencing has now enabled a cost-effective enumeration of the full mutational complement of a tumor genome—in particular single nucleotide variants (SNVs). Most current computational and statistical models for analyzing next generation sequencing data, however, do not account for ca...

Descripción completa

Detalles Bibliográficos
Autores principales: Crisan, Anamaria, Goya, Rodrigo, Ha, Gavin, Ding, Jiarui, Prentice, Leah M., Oloumi, Arusha, Senz, Janine, Zeng, Thomas, Tse, Kane, Delaney, Allen, Marra, Marco A., Huntsman, David G., Hirst, Martin, Aparicio, Sam, Shah, Sohrab
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3420914/
https://www.ncbi.nlm.nih.gov/pubmed/22916110
http://dx.doi.org/10.1371/journal.pone.0041551
_version_ 1782240942919516160
author Crisan, Anamaria
Goya, Rodrigo
Ha, Gavin
Ding, Jiarui
Prentice, Leah M.
Oloumi, Arusha
Senz, Janine
Zeng, Thomas
Tse, Kane
Delaney, Allen
Marra, Marco A.
Huntsman, David G.
Hirst, Martin
Aparicio, Sam
Shah, Sohrab
author_facet Crisan, Anamaria
Goya, Rodrigo
Ha, Gavin
Ding, Jiarui
Prentice, Leah M.
Oloumi, Arusha
Senz, Janine
Zeng, Thomas
Tse, Kane
Delaney, Allen
Marra, Marco A.
Huntsman, David G.
Hirst, Martin
Aparicio, Sam
Shah, Sohrab
author_sort Crisan, Anamaria
collection PubMed
description Next generation sequencing has now enabled a cost-effective enumeration of the full mutational complement of a tumor genome—in particular single nucleotide variants (SNVs). Most current computational and statistical models for analyzing next generation sequencing data, however, do not account for cancer-specific biological properties, including somatic segmental copy number alterations (CNAs)—which require special treatment of the data. Here we present CoNAn-SNV (Copy Number Annotated SNV): a novel algorithm for the inference of single nucleotide variants (SNVs) that overlap copy number alterations. The method is based on modelling the notion that genomic regions of segmental duplication and amplification induce an extended genotype space where a subset of genotypes will exhibit heavily skewed allelic distributions in SNVs (and therefore render them undetectable by methods that assume diploidy). We introduce the concept of modelling allelic counts from sequencing data using a panel of Binomial mixture models where the number of mixtures for a given locus in the genome is informed by a discrete copy number state given as input. We applied CoNAn-SNV to a previously published whole genome shotgun data set obtained from a lobular breast cancer and show that it is able to discover 21 experimentally revalidated somatic non-synonymous mutations in a lobular breast cancer genome that were not detected using copy number insensitive SNV detection algorithms. Importantly, ROC analysis shows that the increased sensitivity of CoNAn-SNV does not result in disproportionate loss of specificity. This was also supported by analysis of a recently published lymphoma genome with a relatively quiescent karyotype, where CoNAn-SNV showed similar results to other callers except in regions of copy number gain where increased sensitivity was conferred. Our results indicate that in genomically unstable tumors, copy number annotation for SNV detection will be critical to fully characterize the mutational landscape of cancer genomes.
format Online
Article
Text
id pubmed-3420914
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-34209142012-08-22 Mutation Discovery in Regions of Segmental Cancer Genome Amplifications with CoNAn-SNV: A Mixture Model for Next Generation Sequencing of Tumors Crisan, Anamaria Goya, Rodrigo Ha, Gavin Ding, Jiarui Prentice, Leah M. Oloumi, Arusha Senz, Janine Zeng, Thomas Tse, Kane Delaney, Allen Marra, Marco A. Huntsman, David G. Hirst, Martin Aparicio, Sam Shah, Sohrab PLoS One Research Article Next generation sequencing has now enabled a cost-effective enumeration of the full mutational complement of a tumor genome—in particular single nucleotide variants (SNVs). Most current computational and statistical models for analyzing next generation sequencing data, however, do not account for cancer-specific biological properties, including somatic segmental copy number alterations (CNAs)—which require special treatment of the data. Here we present CoNAn-SNV (Copy Number Annotated SNV): a novel algorithm for the inference of single nucleotide variants (SNVs) that overlap copy number alterations. The method is based on modelling the notion that genomic regions of segmental duplication and amplification induce an extended genotype space where a subset of genotypes will exhibit heavily skewed allelic distributions in SNVs (and therefore render them undetectable by methods that assume diploidy). We introduce the concept of modelling allelic counts from sequencing data using a panel of Binomial mixture models where the number of mixtures for a given locus in the genome is informed by a discrete copy number state given as input. We applied CoNAn-SNV to a previously published whole genome shotgun data set obtained from a lobular breast cancer and show that it is able to discover 21 experimentally revalidated somatic non-synonymous mutations in a lobular breast cancer genome that were not detected using copy number insensitive SNV detection algorithms. Importantly, ROC analysis shows that the increased sensitivity of CoNAn-SNV does not result in disproportionate loss of specificity. This was also supported by analysis of a recently published lymphoma genome with a relatively quiescent karyotype, where CoNAn-SNV showed similar results to other callers except in regions of copy number gain where increased sensitivity was conferred. Our results indicate that in genomically unstable tumors, copy number annotation for SNV detection will be critical to fully characterize the mutational landscape of cancer genomes. Public Library of Science 2012-08-16 /pmc/articles/PMC3420914/ /pubmed/22916110 http://dx.doi.org/10.1371/journal.pone.0041551 Text en © 2012 Crisan et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Crisan, Anamaria
Goya, Rodrigo
Ha, Gavin
Ding, Jiarui
Prentice, Leah M.
Oloumi, Arusha
Senz, Janine
Zeng, Thomas
Tse, Kane
Delaney, Allen
Marra, Marco A.
Huntsman, David G.
Hirst, Martin
Aparicio, Sam
Shah, Sohrab
Mutation Discovery in Regions of Segmental Cancer Genome Amplifications with CoNAn-SNV: A Mixture Model for Next Generation Sequencing of Tumors
title Mutation Discovery in Regions of Segmental Cancer Genome Amplifications with CoNAn-SNV: A Mixture Model for Next Generation Sequencing of Tumors
title_full Mutation Discovery in Regions of Segmental Cancer Genome Amplifications with CoNAn-SNV: A Mixture Model for Next Generation Sequencing of Tumors
title_fullStr Mutation Discovery in Regions of Segmental Cancer Genome Amplifications with CoNAn-SNV: A Mixture Model for Next Generation Sequencing of Tumors
title_full_unstemmed Mutation Discovery in Regions of Segmental Cancer Genome Amplifications with CoNAn-SNV: A Mixture Model for Next Generation Sequencing of Tumors
title_short Mutation Discovery in Regions of Segmental Cancer Genome Amplifications with CoNAn-SNV: A Mixture Model for Next Generation Sequencing of Tumors
title_sort mutation discovery in regions of segmental cancer genome amplifications with conan-snv: a mixture model for next generation sequencing of tumors
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3420914/
https://www.ncbi.nlm.nih.gov/pubmed/22916110
http://dx.doi.org/10.1371/journal.pone.0041551
work_keys_str_mv AT crisananamaria mutationdiscoveryinregionsofsegmentalcancergenomeamplificationswithconansnvamixturemodelfornextgenerationsequencingoftumors
AT goyarodrigo mutationdiscoveryinregionsofsegmentalcancergenomeamplificationswithconansnvamixturemodelfornextgenerationsequencingoftumors
AT hagavin mutationdiscoveryinregionsofsegmentalcancergenomeamplificationswithconansnvamixturemodelfornextgenerationsequencingoftumors
AT dingjiarui mutationdiscoveryinregionsofsegmentalcancergenomeamplificationswithconansnvamixturemodelfornextgenerationsequencingoftumors
AT prenticeleahm mutationdiscoveryinregionsofsegmentalcancergenomeamplificationswithconansnvamixturemodelfornextgenerationsequencingoftumors
AT oloumiarusha mutationdiscoveryinregionsofsegmentalcancergenomeamplificationswithconansnvamixturemodelfornextgenerationsequencingoftumors
AT senzjanine mutationdiscoveryinregionsofsegmentalcancergenomeamplificationswithconansnvamixturemodelfornextgenerationsequencingoftumors
AT zengthomas mutationdiscoveryinregionsofsegmentalcancergenomeamplificationswithconansnvamixturemodelfornextgenerationsequencingoftumors
AT tsekane mutationdiscoveryinregionsofsegmentalcancergenomeamplificationswithconansnvamixturemodelfornextgenerationsequencingoftumors
AT delaneyallen mutationdiscoveryinregionsofsegmentalcancergenomeamplificationswithconansnvamixturemodelfornextgenerationsequencingoftumors
AT marramarcoa mutationdiscoveryinregionsofsegmentalcancergenomeamplificationswithconansnvamixturemodelfornextgenerationsequencingoftumors
AT huntsmandavidg mutationdiscoveryinregionsofsegmentalcancergenomeamplificationswithconansnvamixturemodelfornextgenerationsequencingoftumors
AT hirstmartin mutationdiscoveryinregionsofsegmentalcancergenomeamplificationswithconansnvamixturemodelfornextgenerationsequencingoftumors
AT apariciosam mutationdiscoveryinregionsofsegmentalcancergenomeamplificationswithconansnvamixturemodelfornextgenerationsequencingoftumors
AT shahsohrab mutationdiscoveryinregionsofsegmentalcancergenomeamplificationswithconansnvamixturemodelfornextgenerationsequencingoftumors