Cargando…

PSCC: Sensitive and Reliable Population-Scale Copy Number Variation Detection Method Based on Low Coverage Sequencing

BACKGROUND: Copy number variations (CNVs) represent an important type of genetic variation that deeply impact phenotypic polymorphisms and human diseases. The advent of high-throughput sequencing technologies provides an opportunity to revolutionize the discovery of CNVs and to explore their relatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Xuchao, Chen, Shengpei, Xie, Weiwei, Vogel, Ida, Choy, Kwong Wai, Chen, Fang, Christensen, Rikke, Zhang, Chunlei, Ge, Huijuan, Jiang, Haojun, Yu, Chang, Huang, Fang, Wang, Wei, Jiang, Hui, Zhang, Xiuqing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3897425/
https://www.ncbi.nlm.nih.gov/pubmed/24465483
http://dx.doi.org/10.1371/journal.pone.0085096
_version_ 1782300231020314624
author Li, Xuchao
Chen, Shengpei
Xie, Weiwei
Vogel, Ida
Choy, Kwong Wai
Chen, Fang
Christensen, Rikke
Zhang, Chunlei
Ge, Huijuan
Jiang, Haojun
Yu, Chang
Huang, Fang
Wang, Wei
Jiang, Hui
Zhang, Xiuqing
author_facet Li, Xuchao
Chen, Shengpei
Xie, Weiwei
Vogel, Ida
Choy, Kwong Wai
Chen, Fang
Christensen, Rikke
Zhang, Chunlei
Ge, Huijuan
Jiang, Haojun
Yu, Chang
Huang, Fang
Wang, Wei
Jiang, Hui
Zhang, Xiuqing
author_sort Li, Xuchao
collection PubMed
description BACKGROUND: Copy number variations (CNVs) represent an important type of genetic variation that deeply impact phenotypic polymorphisms and human diseases. The advent of high-throughput sequencing technologies provides an opportunity to revolutionize the discovery of CNVs and to explore their relationship with diseases. However, most of the existing methods depend on sequencing depth and show instability with low sequence coverage. In this study, using low coverage whole-genome sequencing (LCS) we have developed an effective population-scale CNV calling (PSCC) method. METHODOLOGY/PRINCIPAL FINDINGS: In our novel method, two-step correction was used to remove biases caused by local GC content and complex genomic characteristics. We chose a binary segmentation method to locate CNV segments and designed combined statistics tests to ensure the stable performance of the false positive control. The simulation data showed that our PSCC method could achieve 99.7%/100% and 98.6%/100% sensitivity and specificity for over 300 kb CNV calling in the condition of LCS (∼2×) and ultra LCS (∼0.2×), respectively. Finally, we applied this novel method to analyze 34 clinical samples with an average of 2× LCS. In the final results, all the 31 pathogenic CNVs identified by aCGH were successfully detected. In addition, the performance comparison revealed that our method had significant advantages over existing methods using ultra LCS. CONCLUSIONS/SIGNIFICANCE: Our study showed that PSCC can sensitively and reliably detect CNVs using low coverage or even ultra-low coverage data through population-scale sequencing.
format Online
Article
Text
id pubmed-3897425
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-38974252014-01-24 PSCC: Sensitive and Reliable Population-Scale Copy Number Variation Detection Method Based on Low Coverage Sequencing Li, Xuchao Chen, Shengpei Xie, Weiwei Vogel, Ida Choy, Kwong Wai Chen, Fang Christensen, Rikke Zhang, Chunlei Ge, Huijuan Jiang, Haojun Yu, Chang Huang, Fang Wang, Wei Jiang, Hui Zhang, Xiuqing PLoS One Research Article BACKGROUND: Copy number variations (CNVs) represent an important type of genetic variation that deeply impact phenotypic polymorphisms and human diseases. The advent of high-throughput sequencing technologies provides an opportunity to revolutionize the discovery of CNVs and to explore their relationship with diseases. However, most of the existing methods depend on sequencing depth and show instability with low sequence coverage. In this study, using low coverage whole-genome sequencing (LCS) we have developed an effective population-scale CNV calling (PSCC) method. METHODOLOGY/PRINCIPAL FINDINGS: In our novel method, two-step correction was used to remove biases caused by local GC content and complex genomic characteristics. We chose a binary segmentation method to locate CNV segments and designed combined statistics tests to ensure the stable performance of the false positive control. The simulation data showed that our PSCC method could achieve 99.7%/100% and 98.6%/100% sensitivity and specificity for over 300 kb CNV calling in the condition of LCS (∼2×) and ultra LCS (∼0.2×), respectively. Finally, we applied this novel method to analyze 34 clinical samples with an average of 2× LCS. In the final results, all the 31 pathogenic CNVs identified by aCGH were successfully detected. In addition, the performance comparison revealed that our method had significant advantages over existing methods using ultra LCS. CONCLUSIONS/SIGNIFICANCE: Our study showed that PSCC can sensitively and reliably detect CNVs using low coverage or even ultra-low coverage data through population-scale sequencing. Public Library of Science 2014-01-21 /pmc/articles/PMC3897425/ /pubmed/24465483 http://dx.doi.org/10.1371/journal.pone.0085096 Text en © 2014 Li et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Li, Xuchao
Chen, Shengpei
Xie, Weiwei
Vogel, Ida
Choy, Kwong Wai
Chen, Fang
Christensen, Rikke
Zhang, Chunlei
Ge, Huijuan
Jiang, Haojun
Yu, Chang
Huang, Fang
Wang, Wei
Jiang, Hui
Zhang, Xiuqing
PSCC: Sensitive and Reliable Population-Scale Copy Number Variation Detection Method Based on Low Coverage Sequencing
title PSCC: Sensitive and Reliable Population-Scale Copy Number Variation Detection Method Based on Low Coverage Sequencing
title_full PSCC: Sensitive and Reliable Population-Scale Copy Number Variation Detection Method Based on Low Coverage Sequencing
title_fullStr PSCC: Sensitive and Reliable Population-Scale Copy Number Variation Detection Method Based on Low Coverage Sequencing
title_full_unstemmed PSCC: Sensitive and Reliable Population-Scale Copy Number Variation Detection Method Based on Low Coverage Sequencing
title_short PSCC: Sensitive and Reliable Population-Scale Copy Number Variation Detection Method Based on Low Coverage Sequencing
title_sort pscc: sensitive and reliable population-scale copy number variation detection method based on low coverage sequencing
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3897425/
https://www.ncbi.nlm.nih.gov/pubmed/24465483
http://dx.doi.org/10.1371/journal.pone.0085096
work_keys_str_mv AT lixuchao psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing
AT chenshengpei psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing
AT xieweiwei psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing
AT vogelida psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing
AT choykwongwai psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing
AT chenfang psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing
AT christensenrikke psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing
AT zhangchunlei psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing
AT gehuijuan psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing
AT jianghaojun psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing
AT yuchang psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing
AT huangfang psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing
AT wangwei psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing
AT jianghui psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing
AT zhangxiuqing psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing