Cargando…
PSCC: Sensitive and Reliable Population-Scale Copy Number Variation Detection Method Based on Low Coverage Sequencing
BACKGROUND: Copy number variations (CNVs) represent an important type of genetic variation that deeply impact phenotypic polymorphisms and human diseases. The advent of high-throughput sequencing technologies provides an opportunity to revolutionize the discovery of CNVs and to explore their relatio...
Autores principales: | , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3897425/ https://www.ncbi.nlm.nih.gov/pubmed/24465483 http://dx.doi.org/10.1371/journal.pone.0085096 |
_version_ | 1782300231020314624 |
---|---|
author | Li, Xuchao Chen, Shengpei Xie, Weiwei Vogel, Ida Choy, Kwong Wai Chen, Fang Christensen, Rikke Zhang, Chunlei Ge, Huijuan Jiang, Haojun Yu, Chang Huang, Fang Wang, Wei Jiang, Hui Zhang, Xiuqing |
author_facet | Li, Xuchao Chen, Shengpei Xie, Weiwei Vogel, Ida Choy, Kwong Wai Chen, Fang Christensen, Rikke Zhang, Chunlei Ge, Huijuan Jiang, Haojun Yu, Chang Huang, Fang Wang, Wei Jiang, Hui Zhang, Xiuqing |
author_sort | Li, Xuchao |
collection | PubMed |
description | BACKGROUND: Copy number variations (CNVs) represent an important type of genetic variation that deeply impact phenotypic polymorphisms and human diseases. The advent of high-throughput sequencing technologies provides an opportunity to revolutionize the discovery of CNVs and to explore their relationship with diseases. However, most of the existing methods depend on sequencing depth and show instability with low sequence coverage. In this study, using low coverage whole-genome sequencing (LCS) we have developed an effective population-scale CNV calling (PSCC) method. METHODOLOGY/PRINCIPAL FINDINGS: In our novel method, two-step correction was used to remove biases caused by local GC content and complex genomic characteristics. We chose a binary segmentation method to locate CNV segments and designed combined statistics tests to ensure the stable performance of the false positive control. The simulation data showed that our PSCC method could achieve 99.7%/100% and 98.6%/100% sensitivity and specificity for over 300 kb CNV calling in the condition of LCS (∼2×) and ultra LCS (∼0.2×), respectively. Finally, we applied this novel method to analyze 34 clinical samples with an average of 2× LCS. In the final results, all the 31 pathogenic CNVs identified by aCGH were successfully detected. In addition, the performance comparison revealed that our method had significant advantages over existing methods using ultra LCS. CONCLUSIONS/SIGNIFICANCE: Our study showed that PSCC can sensitively and reliably detect CNVs using low coverage or even ultra-low coverage data through population-scale sequencing. |
format | Online Article Text |
id | pubmed-3897425 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-38974252014-01-24 PSCC: Sensitive and Reliable Population-Scale Copy Number Variation Detection Method Based on Low Coverage Sequencing Li, Xuchao Chen, Shengpei Xie, Weiwei Vogel, Ida Choy, Kwong Wai Chen, Fang Christensen, Rikke Zhang, Chunlei Ge, Huijuan Jiang, Haojun Yu, Chang Huang, Fang Wang, Wei Jiang, Hui Zhang, Xiuqing PLoS One Research Article BACKGROUND: Copy number variations (CNVs) represent an important type of genetic variation that deeply impact phenotypic polymorphisms and human diseases. The advent of high-throughput sequencing technologies provides an opportunity to revolutionize the discovery of CNVs and to explore their relationship with diseases. However, most of the existing methods depend on sequencing depth and show instability with low sequence coverage. In this study, using low coverage whole-genome sequencing (LCS) we have developed an effective population-scale CNV calling (PSCC) method. METHODOLOGY/PRINCIPAL FINDINGS: In our novel method, two-step correction was used to remove biases caused by local GC content and complex genomic characteristics. We chose a binary segmentation method to locate CNV segments and designed combined statistics tests to ensure the stable performance of the false positive control. The simulation data showed that our PSCC method could achieve 99.7%/100% and 98.6%/100% sensitivity and specificity for over 300 kb CNV calling in the condition of LCS (∼2×) and ultra LCS (∼0.2×), respectively. Finally, we applied this novel method to analyze 34 clinical samples with an average of 2× LCS. In the final results, all the 31 pathogenic CNVs identified by aCGH were successfully detected. In addition, the performance comparison revealed that our method had significant advantages over existing methods using ultra LCS. CONCLUSIONS/SIGNIFICANCE: Our study showed that PSCC can sensitively and reliably detect CNVs using low coverage or even ultra-low coverage data through population-scale sequencing. Public Library of Science 2014-01-21 /pmc/articles/PMC3897425/ /pubmed/24465483 http://dx.doi.org/10.1371/journal.pone.0085096 Text en © 2014 Li et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Li, Xuchao Chen, Shengpei Xie, Weiwei Vogel, Ida Choy, Kwong Wai Chen, Fang Christensen, Rikke Zhang, Chunlei Ge, Huijuan Jiang, Haojun Yu, Chang Huang, Fang Wang, Wei Jiang, Hui Zhang, Xiuqing PSCC: Sensitive and Reliable Population-Scale Copy Number Variation Detection Method Based on Low Coverage Sequencing |
title | PSCC: Sensitive and Reliable Population-Scale Copy Number Variation Detection Method Based on Low Coverage Sequencing |
title_full | PSCC: Sensitive and Reliable Population-Scale Copy Number Variation Detection Method Based on Low Coverage Sequencing |
title_fullStr | PSCC: Sensitive and Reliable Population-Scale Copy Number Variation Detection Method Based on Low Coverage Sequencing |
title_full_unstemmed | PSCC: Sensitive and Reliable Population-Scale Copy Number Variation Detection Method Based on Low Coverage Sequencing |
title_short | PSCC: Sensitive and Reliable Population-Scale Copy Number Variation Detection Method Based on Low Coverage Sequencing |
title_sort | pscc: sensitive and reliable population-scale copy number variation detection method based on low coverage sequencing |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3897425/ https://www.ncbi.nlm.nih.gov/pubmed/24465483 http://dx.doi.org/10.1371/journal.pone.0085096 |
work_keys_str_mv | AT lixuchao psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing AT chenshengpei psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing AT xieweiwei psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing AT vogelida psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing AT choykwongwai psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing AT chenfang psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing AT christensenrikke psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing AT zhangchunlei psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing AT gehuijuan psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing AT jianghaojun psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing AT yuchang psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing AT huangfang psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing AT wangwei psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing AT jianghui psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing AT zhangxiuqing psccsensitiveandreliablepopulationscalecopynumbervariationdetectionmethodbasedonlowcoveragesequencing |