Cargando…

SV-Bay: structural variant detection in cancer genomes using a Bayesian approach with correction for GC-content and read mappability

Motivation: Whole genome sequencing of paired-end reads can be applied to characterize the landscape of large somatic rearrangements of cancer genomes. Several methods for detecting structural variants with whole genome sequencing data have been developed. So far, none of these methods has combined...

Descripción completa

Detalles Bibliográficos
Autores principales: Iakovishina, Daria, Janoueix-Lerosey, Isabelle, Barillot, Emmanuel, Regnier, Mireille, Boeva, Valentina
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4896370/
https://www.ncbi.nlm.nih.gov/pubmed/26740523
http://dx.doi.org/10.1093/bioinformatics/btv751
_version_ 1782436009691054080
author Iakovishina, Daria
Janoueix-Lerosey, Isabelle
Barillot, Emmanuel
Regnier, Mireille
Boeva, Valentina
author_facet Iakovishina, Daria
Janoueix-Lerosey, Isabelle
Barillot, Emmanuel
Regnier, Mireille
Boeva, Valentina
author_sort Iakovishina, Daria
collection PubMed
description Motivation: Whole genome sequencing of paired-end reads can be applied to characterize the landscape of large somatic rearrangements of cancer genomes. Several methods for detecting structural variants with whole genome sequencing data have been developed. So far, none of these methods has combined information about abnormally mapped read pairs connecting rearranged regions and associated global copy number changes automatically inferred from the same sequencing data file. Our aim was to create a computational method that could use both types of information, i.e. normal and abnormal reads, and demonstrate that by doing so we can highly improve both sensitivity and specificity rates of structural variant prediction. Results: We developed a computational method, SV-Bay, to detect structural variants from whole genome sequencing mate-pair or paired-end data using a probabilistic Bayesian approach. This approach takes into account depth of coverage by normal reads and abnormalities in read pair mappings. To estimate the model likelihood, SV-Bay considers GC-content and read mappability of the genome, thus making important corrections to the expected read count. For the detection of somatic variants, SV-Bay makes use of a matched normal sample when it is available. We validated SV-Bay on simulated datasets and an experimental mate-pair dataset for the CLB-GA neuroblastoma cell line. The comparison of SV-Bay with several other methods for structural variant detection demonstrated that SV-Bay has better prediction accuracy both in terms of sensitivity and false-positive detection rate. Availability and implementation: https://github.com/InstitutCurie/SV-Bay Contact: valentina.boeva@inserm.fr Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-4896370
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-48963702016-06-09 SV-Bay: structural variant detection in cancer genomes using a Bayesian approach with correction for GC-content and read mappability Iakovishina, Daria Janoueix-Lerosey, Isabelle Barillot, Emmanuel Regnier, Mireille Boeva, Valentina Bioinformatics Recomb-Seq/Recomb-Cbb Motivation: Whole genome sequencing of paired-end reads can be applied to characterize the landscape of large somatic rearrangements of cancer genomes. Several methods for detecting structural variants with whole genome sequencing data have been developed. So far, none of these methods has combined information about abnormally mapped read pairs connecting rearranged regions and associated global copy number changes automatically inferred from the same sequencing data file. Our aim was to create a computational method that could use both types of information, i.e. normal and abnormal reads, and demonstrate that by doing so we can highly improve both sensitivity and specificity rates of structural variant prediction. Results: We developed a computational method, SV-Bay, to detect structural variants from whole genome sequencing mate-pair or paired-end data using a probabilistic Bayesian approach. This approach takes into account depth of coverage by normal reads and abnormalities in read pair mappings. To estimate the model likelihood, SV-Bay considers GC-content and read mappability of the genome, thus making important corrections to the expected read count. For the detection of somatic variants, SV-Bay makes use of a matched normal sample when it is available. We validated SV-Bay on simulated datasets and an experimental mate-pair dataset for the CLB-GA neuroblastoma cell line. The comparison of SV-Bay with several other methods for structural variant detection demonstrated that SV-Bay has better prediction accuracy both in terms of sensitivity and false-positive detection rate. Availability and implementation: https://github.com/InstitutCurie/SV-Bay Contact: valentina.boeva@inserm.fr Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2016-04-01 2016-01-06 /pmc/articles/PMC4896370/ /pubmed/26740523 http://dx.doi.org/10.1093/bioinformatics/btv751 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Recomb-Seq/Recomb-Cbb
Iakovishina, Daria
Janoueix-Lerosey, Isabelle
Barillot, Emmanuel
Regnier, Mireille
Boeva, Valentina
SV-Bay: structural variant detection in cancer genomes using a Bayesian approach with correction for GC-content and read mappability
title SV-Bay: structural variant detection in cancer genomes using a Bayesian approach with correction for GC-content and read mappability
title_full SV-Bay: structural variant detection in cancer genomes using a Bayesian approach with correction for GC-content and read mappability
title_fullStr SV-Bay: structural variant detection in cancer genomes using a Bayesian approach with correction for GC-content and read mappability
title_full_unstemmed SV-Bay: structural variant detection in cancer genomes using a Bayesian approach with correction for GC-content and read mappability
title_short SV-Bay: structural variant detection in cancer genomes using a Bayesian approach with correction for GC-content and read mappability
title_sort sv-bay: structural variant detection in cancer genomes using a bayesian approach with correction for gc-content and read mappability
topic Recomb-Seq/Recomb-Cbb
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4896370/
https://www.ncbi.nlm.nih.gov/pubmed/26740523
http://dx.doi.org/10.1093/bioinformatics/btv751
work_keys_str_mv AT iakovishinadaria svbaystructuralvariantdetectionincancergenomesusingabayesianapproachwithcorrectionforgccontentandreadmappability
AT janoueixleroseyisabelle svbaystructuralvariantdetectionincancergenomesusingabayesianapproachwithcorrectionforgccontentandreadmappability
AT barillotemmanuel svbaystructuralvariantdetectionincancergenomesusingabayesianapproachwithcorrectionforgccontentandreadmappability
AT regniermireille svbaystructuralvariantdetectionincancergenomesusingabayesianapproachwithcorrectionforgccontentandreadmappability
AT boevavalentina svbaystructuralvariantdetectionincancergenomesusingabayesianapproachwithcorrectionforgccontentandreadmappability