Cargando…

Detecting differential copy number variation between groups of samples

We present a method to detect copy number variants (CNVs) that are differentially present between two groups of sequenced samples. We use a finite-state transducer where the emitted read depth is conditioned on the mappability and GC-content of all reads that occur at a given base position. In this...

Descripción completa

Detalles Bibliográficos
Autores principales: Lowe, Craig B., Sanchez-Luege, Nicelio, Howes, Timothy R., Brady, Shannon D., Daugherty, Rhea R., Jones, Felicity C., Bell, Michael A., Kingsley, David M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5793789/
https://www.ncbi.nlm.nih.gov/pubmed/29229672
http://dx.doi.org/10.1101/gr.206938.116
_version_ 1783297023179489280
author Lowe, Craig B.
Sanchez-Luege, Nicelio
Howes, Timothy R.
Brady, Shannon D.
Daugherty, Rhea R.
Jones, Felicity C.
Bell, Michael A.
Kingsley, David M.
author_facet Lowe, Craig B.
Sanchez-Luege, Nicelio
Howes, Timothy R.
Brady, Shannon D.
Daugherty, Rhea R.
Jones, Felicity C.
Bell, Michael A.
Kingsley, David M.
author_sort Lowe, Craig B.
collection PubMed
description We present a method to detect copy number variants (CNVs) that are differentially present between two groups of sequenced samples. We use a finite-state transducer where the emitted read depth is conditioned on the mappability and GC-content of all reads that occur at a given base position. In this model, the read depth within a region is a mixture of binomials, which in simulations matches the read depth more closely than the often-used negative binomial distribution. The method analyzes all samples simultaneously, preserving uncertainty as to the breakpoints and magnitude of CNVs present in an individual when it identifies CNVs differentially present between the two groups. We apply this method to identify CNVs that are recurrently associated with postglacial adaptation of marine threespine stickleback (Gasterosteus aculeatus) to freshwater. We identify 6664 regions of the stickleback genome, totaling 1.7 Mbp, which show consistent copy number differences between marine and freshwater populations. These deletions and duplications affect both protein-coding genes and cis-regulatory elements, including a noncoding intronic telencephalon enhancer of DCHS1. The functions of the genes near or included within the 6664 CNVs are enriched for immunity and muscle development, as well as head and limb morphology. Although freshwater stickleback have repeatedly evolved from marine populations, we show that freshwater stickleback also act as reservoirs for ancient ancestral sequences that are highly conserved among distantly related teleosts, but largely missing from marine stickleback due to recent selective sweeps in marine populations.
format Online
Article
Text
id pubmed-5793789
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-57937892018-08-01 Detecting differential copy number variation between groups of samples Lowe, Craig B. Sanchez-Luege, Nicelio Howes, Timothy R. Brady, Shannon D. Daugherty, Rhea R. Jones, Felicity C. Bell, Michael A. Kingsley, David M. Genome Res Method We present a method to detect copy number variants (CNVs) that are differentially present between two groups of sequenced samples. We use a finite-state transducer where the emitted read depth is conditioned on the mappability and GC-content of all reads that occur at a given base position. In this model, the read depth within a region is a mixture of binomials, which in simulations matches the read depth more closely than the often-used negative binomial distribution. The method analyzes all samples simultaneously, preserving uncertainty as to the breakpoints and magnitude of CNVs present in an individual when it identifies CNVs differentially present between the two groups. We apply this method to identify CNVs that are recurrently associated with postglacial adaptation of marine threespine stickleback (Gasterosteus aculeatus) to freshwater. We identify 6664 regions of the stickleback genome, totaling 1.7 Mbp, which show consistent copy number differences between marine and freshwater populations. These deletions and duplications affect both protein-coding genes and cis-regulatory elements, including a noncoding intronic telencephalon enhancer of DCHS1. The functions of the genes near or included within the 6664 CNVs are enriched for immunity and muscle development, as well as head and limb morphology. Although freshwater stickleback have repeatedly evolved from marine populations, we show that freshwater stickleback also act as reservoirs for ancient ancestral sequences that are highly conserved among distantly related teleosts, but largely missing from marine stickleback due to recent selective sweeps in marine populations. Cold Spring Harbor Laboratory Press 2018-02 /pmc/articles/PMC5793789/ /pubmed/29229672 http://dx.doi.org/10.1101/gr.206938.116 Text en © 2018 Lowe et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Method
Lowe, Craig B.
Sanchez-Luege, Nicelio
Howes, Timothy R.
Brady, Shannon D.
Daugherty, Rhea R.
Jones, Felicity C.
Bell, Michael A.
Kingsley, David M.
Detecting differential copy number variation between groups of samples
title Detecting differential copy number variation between groups of samples
title_full Detecting differential copy number variation between groups of samples
title_fullStr Detecting differential copy number variation between groups of samples
title_full_unstemmed Detecting differential copy number variation between groups of samples
title_short Detecting differential copy number variation between groups of samples
title_sort detecting differential copy number variation between groups of samples
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5793789/
https://www.ncbi.nlm.nih.gov/pubmed/29229672
http://dx.doi.org/10.1101/gr.206938.116
work_keys_str_mv AT lowecraigb detectingdifferentialcopynumbervariationbetweengroupsofsamples
AT sanchezluegenicelio detectingdifferentialcopynumbervariationbetweengroupsofsamples
AT howestimothyr detectingdifferentialcopynumbervariationbetweengroupsofsamples
AT bradyshannond detectingdifferentialcopynumbervariationbetweengroupsofsamples
AT daughertyrhear detectingdifferentialcopynumbervariationbetweengroupsofsamples
AT jonesfelicityc detectingdifferentialcopynumbervariationbetweengroupsofsamples
AT bellmichaela detectingdifferentialcopynumbervariationbetweengroupsofsamples
AT kingsleydavidm detectingdifferentialcopynumbervariationbetweengroupsofsamples