Cargando…
A Cluster-Based Approach for the Discovery of Copy Number Variations From Next-Generation Sequencing Data
The next-generation sequencing technology offers a wealth of data resources for the detection of copy number variations (CNVs) at a high resolution. However, it is still challenging to correctly detect CNVs of different lengths. It is necessary to develop new CNV detection tools to meet this demand....
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8273656/ https://www.ncbi.nlm.nih.gov/pubmed/34262604 http://dx.doi.org/10.3389/fgene.2021.699510 |
_version_ | 1783721414099992576 |
---|---|
author | Liu, Guojun Zhang, Junying |
author_facet | Liu, Guojun Zhang, Junying |
author_sort | Liu, Guojun |
collection | PubMed |
description | The next-generation sequencing technology offers a wealth of data resources for the detection of copy number variations (CNVs) at a high resolution. However, it is still challenging to correctly detect CNVs of different lengths. It is necessary to develop new CNV detection tools to meet this demand. In this work, we propose a new CNV detection method, called CBCNV, for the detection of CNVs of different lengths from whole genome sequencing data. CBCNV uses a clustering algorithm to divide the read depth segment profile, and assigns an abnormal score to each read depth segment. Based on the abnormal score profile, Tukey’s fences method is adopted in CBCNV to forecast CNVs. The performance of the proposed method is evaluated on simulated data sets, and is compared with those of several existing methods. The experimental results prove that the performance of CBCNV is better than those of several existing methods. The proposed method is further tested and verified on real data sets, and the experimental results are found to be consistent with the simulation results. Therefore, the proposed method can be expected to become a routine tool in the analysis of CNVs from tumor-normal matched samples. |
format | Online Article Text |
id | pubmed-8273656 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-82736562021-07-13 A Cluster-Based Approach for the Discovery of Copy Number Variations From Next-Generation Sequencing Data Liu, Guojun Zhang, Junying Front Genet Genetics The next-generation sequencing technology offers a wealth of data resources for the detection of copy number variations (CNVs) at a high resolution. However, it is still challenging to correctly detect CNVs of different lengths. It is necessary to develop new CNV detection tools to meet this demand. In this work, we propose a new CNV detection method, called CBCNV, for the detection of CNVs of different lengths from whole genome sequencing data. CBCNV uses a clustering algorithm to divide the read depth segment profile, and assigns an abnormal score to each read depth segment. Based on the abnormal score profile, Tukey’s fences method is adopted in CBCNV to forecast CNVs. The performance of the proposed method is evaluated on simulated data sets, and is compared with those of several existing methods. The experimental results prove that the performance of CBCNV is better than those of several existing methods. The proposed method is further tested and verified on real data sets, and the experimental results are found to be consistent with the simulation results. Therefore, the proposed method can be expected to become a routine tool in the analysis of CNVs from tumor-normal matched samples. Frontiers Media S.A. 2021-06-28 /pmc/articles/PMC8273656/ /pubmed/34262604 http://dx.doi.org/10.3389/fgene.2021.699510 Text en Copyright © 2021 Liu and Zhang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Liu, Guojun Zhang, Junying A Cluster-Based Approach for the Discovery of Copy Number Variations From Next-Generation Sequencing Data |
title | A Cluster-Based Approach for the Discovery of Copy Number Variations From Next-Generation Sequencing Data |
title_full | A Cluster-Based Approach for the Discovery of Copy Number Variations From Next-Generation Sequencing Data |
title_fullStr | A Cluster-Based Approach for the Discovery of Copy Number Variations From Next-Generation Sequencing Data |
title_full_unstemmed | A Cluster-Based Approach for the Discovery of Copy Number Variations From Next-Generation Sequencing Data |
title_short | A Cluster-Based Approach for the Discovery of Copy Number Variations From Next-Generation Sequencing Data |
title_sort | cluster-based approach for the discovery of copy number variations from next-generation sequencing data |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8273656/ https://www.ncbi.nlm.nih.gov/pubmed/34262604 http://dx.doi.org/10.3389/fgene.2021.699510 |
work_keys_str_mv | AT liuguojun aclusterbasedapproachforthediscoveryofcopynumbervariationsfromnextgenerationsequencingdata AT zhangjunying aclusterbasedapproachforthediscoveryofcopynumbervariationsfromnextgenerationsequencingdata AT liuguojun clusterbasedapproachforthediscoveryofcopynumbervariationsfromnextgenerationsequencingdata AT zhangjunying clusterbasedapproachforthediscoveryofcopynumbervariationsfromnextgenerationsequencingdata |