Cargando…
Comparison of Sequencing Based CNV Discovery Methods Using Monozygotic Twin Quartets
BACKGROUND: The advent of high throughput sequencing methods breeds an important amount of technical challenges. Among those is the one raised by the discovery of copy-number variations (CNVs) using whole-genome sequencing data. CNVs are genomic structural variations defined as a variation in the nu...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4374778/ https://www.ncbi.nlm.nih.gov/pubmed/25812131 http://dx.doi.org/10.1371/journal.pone.0122287 |
_version_ | 1782363543999348736 |
---|---|
author | Legault, Marc-André Girard, Simon Lemieux Perreault, Louis-Philippe Rouleau, Guy A. Dubé, Marie-Pierre |
author_facet | Legault, Marc-André Girard, Simon Lemieux Perreault, Louis-Philippe Rouleau, Guy A. Dubé, Marie-Pierre |
author_sort | Legault, Marc-André |
collection | PubMed |
description | BACKGROUND: The advent of high throughput sequencing methods breeds an important amount of technical challenges. Among those is the one raised by the discovery of copy-number variations (CNVs) using whole-genome sequencing data. CNVs are genomic structural variations defined as a variation in the number of copies of a large genomic fragment, usually more than one kilobase. Here, we aim to compare different CNV calling methods in order to assess their ability to consistently identify CNVs by comparison of the calls in 9 quartets of identical twin pairs. The use of monozygotic twins provides a means of estimating the error rate of each algorithm by observing CNVs that are inconsistently called when considering the rules of Mendelian inheritance and the assumption of an identical genome between twins. The similarity between the calls from the different tools and the advantage of combining call sets were also considered. RESULTS: ERDS and CNVnator obtained the best performance when considering the inherited CNV rate with a mean of 0.74 and 0.70, respectively. Venn diagrams were generated to show the agreement between the different algorithms, before and after filtering out familial inconsistencies. This filtering revealed a high number of false positives for CNVer and Breakdancer. A low overall agreement between the methods suggested a high complementarity of the different tools when calling CNVs. The breakpoint sensitivity analysis indicated that CNVnator and ERDS achieved better resolution of CNV borders than the other tools. The highest inherited CNV rate was achieved through the intersection of these two tools (81%). CONCLUSIONS: This study showed that ERDS and CNVnator provide good performance on whole genome sequencing data with respect to CNV consistency across families, CNV breakpoint resolution and CNV call specificity. The intersection of the calls from the two tools would be valuable for CNV genotyping pipelines. |
format | Online Article Text |
id | pubmed-4374778 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-43747782015-04-04 Comparison of Sequencing Based CNV Discovery Methods Using Monozygotic Twin Quartets Legault, Marc-André Girard, Simon Lemieux Perreault, Louis-Philippe Rouleau, Guy A. Dubé, Marie-Pierre PLoS One Research Article BACKGROUND: The advent of high throughput sequencing methods breeds an important amount of technical challenges. Among those is the one raised by the discovery of copy-number variations (CNVs) using whole-genome sequencing data. CNVs are genomic structural variations defined as a variation in the number of copies of a large genomic fragment, usually more than one kilobase. Here, we aim to compare different CNV calling methods in order to assess their ability to consistently identify CNVs by comparison of the calls in 9 quartets of identical twin pairs. The use of monozygotic twins provides a means of estimating the error rate of each algorithm by observing CNVs that are inconsistently called when considering the rules of Mendelian inheritance and the assumption of an identical genome between twins. The similarity between the calls from the different tools and the advantage of combining call sets were also considered. RESULTS: ERDS and CNVnator obtained the best performance when considering the inherited CNV rate with a mean of 0.74 and 0.70, respectively. Venn diagrams were generated to show the agreement between the different algorithms, before and after filtering out familial inconsistencies. This filtering revealed a high number of false positives for CNVer and Breakdancer. A low overall agreement between the methods suggested a high complementarity of the different tools when calling CNVs. The breakpoint sensitivity analysis indicated that CNVnator and ERDS achieved better resolution of CNV borders than the other tools. The highest inherited CNV rate was achieved through the intersection of these two tools (81%). CONCLUSIONS: This study showed that ERDS and CNVnator provide good performance on whole genome sequencing data with respect to CNV consistency across families, CNV breakpoint resolution and CNV call specificity. The intersection of the calls from the two tools would be valuable for CNV genotyping pipelines. Public Library of Science 2015-03-26 /pmc/articles/PMC4374778/ /pubmed/25812131 http://dx.doi.org/10.1371/journal.pone.0122287 Text en © 2015 Legault et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Legault, Marc-André Girard, Simon Lemieux Perreault, Louis-Philippe Rouleau, Guy A. Dubé, Marie-Pierre Comparison of Sequencing Based CNV Discovery Methods Using Monozygotic Twin Quartets |
title | Comparison of Sequencing Based CNV Discovery Methods Using Monozygotic Twin Quartets |
title_full | Comparison of Sequencing Based CNV Discovery Methods Using Monozygotic Twin Quartets |
title_fullStr | Comparison of Sequencing Based CNV Discovery Methods Using Monozygotic Twin Quartets |
title_full_unstemmed | Comparison of Sequencing Based CNV Discovery Methods Using Monozygotic Twin Quartets |
title_short | Comparison of Sequencing Based CNV Discovery Methods Using Monozygotic Twin Quartets |
title_sort | comparison of sequencing based cnv discovery methods using monozygotic twin quartets |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4374778/ https://www.ncbi.nlm.nih.gov/pubmed/25812131 http://dx.doi.org/10.1371/journal.pone.0122287 |
work_keys_str_mv | AT legaultmarcandre comparisonofsequencingbasedcnvdiscoverymethodsusingmonozygotictwinquartets AT girardsimon comparisonofsequencingbasedcnvdiscoverymethodsusingmonozygotictwinquartets AT lemieuxperreaultlouisphilippe comparisonofsequencingbasedcnvdiscoverymethodsusingmonozygotictwinquartets AT rouleauguya comparisonofsequencingbasedcnvdiscoverymethodsusingmonozygotictwinquartets AT dubemariepierre comparisonofsequencingbasedcnvdiscoverymethodsusingmonozygotictwinquartets |