Cargando…

A Cluster-Based Approach for the Discovery of Copy Number Variations From Next-Generation Sequencing Data

The next-generation sequencing technology offers a wealth of data resources for the detection of copy number variations (CNVs) at a high resolution. However, it is still challenging to correctly detect CNVs of different lengths. It is necessary to develop new CNV detection tools to meet this demand....

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Guojun, Zhang, Junying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8273656/
https://www.ncbi.nlm.nih.gov/pubmed/34262604
http://dx.doi.org/10.3389/fgene.2021.699510
_version_ 1783721414099992576
author Liu, Guojun
Zhang, Junying
author_facet Liu, Guojun
Zhang, Junying
author_sort Liu, Guojun
collection PubMed
description The next-generation sequencing technology offers a wealth of data resources for the detection of copy number variations (CNVs) at a high resolution. However, it is still challenging to correctly detect CNVs of different lengths. It is necessary to develop new CNV detection tools to meet this demand. In this work, we propose a new CNV detection method, called CBCNV, for the detection of CNVs of different lengths from whole genome sequencing data. CBCNV uses a clustering algorithm to divide the read depth segment profile, and assigns an abnormal score to each read depth segment. Based on the abnormal score profile, Tukey’s fences method is adopted in CBCNV to forecast CNVs. The performance of the proposed method is evaluated on simulated data sets, and is compared with those of several existing methods. The experimental results prove that the performance of CBCNV is better than those of several existing methods. The proposed method is further tested and verified on real data sets, and the experimental results are found to be consistent with the simulation results. Therefore, the proposed method can be expected to become a routine tool in the analysis of CNVs from tumor-normal matched samples.
format Online
Article
Text
id pubmed-8273656
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-82736562021-07-13 A Cluster-Based Approach for the Discovery of Copy Number Variations From Next-Generation Sequencing Data Liu, Guojun Zhang, Junying Front Genet Genetics The next-generation sequencing technology offers a wealth of data resources for the detection of copy number variations (CNVs) at a high resolution. However, it is still challenging to correctly detect CNVs of different lengths. It is necessary to develop new CNV detection tools to meet this demand. In this work, we propose a new CNV detection method, called CBCNV, for the detection of CNVs of different lengths from whole genome sequencing data. CBCNV uses a clustering algorithm to divide the read depth segment profile, and assigns an abnormal score to each read depth segment. Based on the abnormal score profile, Tukey’s fences method is adopted in CBCNV to forecast CNVs. The performance of the proposed method is evaluated on simulated data sets, and is compared with those of several existing methods. The experimental results prove that the performance of CBCNV is better than those of several existing methods. The proposed method is further tested and verified on real data sets, and the experimental results are found to be consistent with the simulation results. Therefore, the proposed method can be expected to become a routine tool in the analysis of CNVs from tumor-normal matched samples. Frontiers Media S.A. 2021-06-28 /pmc/articles/PMC8273656/ /pubmed/34262604 http://dx.doi.org/10.3389/fgene.2021.699510 Text en Copyright © 2021 Liu and Zhang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Liu, Guojun
Zhang, Junying
A Cluster-Based Approach for the Discovery of Copy Number Variations From Next-Generation Sequencing Data
title A Cluster-Based Approach for the Discovery of Copy Number Variations From Next-Generation Sequencing Data
title_full A Cluster-Based Approach for the Discovery of Copy Number Variations From Next-Generation Sequencing Data
title_fullStr A Cluster-Based Approach for the Discovery of Copy Number Variations From Next-Generation Sequencing Data
title_full_unstemmed A Cluster-Based Approach for the Discovery of Copy Number Variations From Next-Generation Sequencing Data
title_short A Cluster-Based Approach for the Discovery of Copy Number Variations From Next-Generation Sequencing Data
title_sort cluster-based approach for the discovery of copy number variations from next-generation sequencing data
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8273656/
https://www.ncbi.nlm.nih.gov/pubmed/34262604
http://dx.doi.org/10.3389/fgene.2021.699510
work_keys_str_mv AT liuguojun aclusterbasedapproachforthediscoveryofcopynumbervariationsfromnextgenerationsequencingdata
AT zhangjunying aclusterbasedapproachforthediscoveryofcopynumbervariationsfromnextgenerationsequencingdata
AT liuguojun clusterbasedapproachforthediscoveryofcopynumbervariationsfromnextgenerationsequencingdata
AT zhangjunying clusterbasedapproachforthediscoveryofcopynumbervariationsfromnextgenerationsequencingdata