Cargando…

CIRCNV: Detection of CNVs Based on a Circular Profile of Read Depth from Sequencing Data

SIMPLE SUMMARY: In this study, we propose a copy number variation (CNV) detection method called CIRCNV, which is based on a circular profile of the read depth from sequencing data. The proposed method is an extended version of our previously developed method CNV-LOF. The main difference of CIRCNV fr...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Hai-Yong, Li, Qi, Tian, Ye, Chen, Yue-Hui, Alvi, Haque A. K., Yuan, Xi-Guo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8301091/
https://www.ncbi.nlm.nih.gov/pubmed/34202028
http://dx.doi.org/10.3390/biology10070584
_version_ 1783726591998689280
author Zhao, Hai-Yong
Li, Qi
Tian, Ye
Chen, Yue-Hui
Alvi, Haque A. K.
Yuan, Xi-Guo
author_facet Zhao, Hai-Yong
Li, Qi
Tian, Ye
Chen, Yue-Hui
Alvi, Haque A. K.
Yuan, Xi-Guo
author_sort Zhao, Hai-Yong
collection PubMed
description SIMPLE SUMMARY: In this study, we propose a copy number variation (CNV) detection method called CIRCNV, which is based on a circular profile of the read depth from sequencing data. The proposed method is an extended version of our previously developed method CNV-LOF. The main difference of CIRCNV from CNV-LOF lies in its two new features: (1) it transfers the read depth profile from a line shape to a circular shape via a polar coordinate transformation to generate a meaningful two-dimensional dataset for CNV analysis and promote fairness between the ends and middle part of the genome, and (2) it performs two rounds of CNV declaration via estimating tumor purity and recovering the truth circular RD profile. We test and evaluate the performance of CIRCNV via conducting simulation studies and real sequencing tumor sample applications. The experimental results show that CIRCNV outperforms peer methods with respect to sensitivity, precision, and the F1-score. The experiments prove that the proposed method is a reliable and effective tool in the field of variation analysis of tumor genomes. ABSTRACT: Copy number variation (CNV) is a common type of structural variation in the human genome. Accurate detection of CNVs from tumor genomes can provide crucial information for the study of tumor genesis and cancer precision diagnosis. However, the contamination of normal genomes in tumor genomes and the crude profiles of the read depth make such a task difficult. In this paper, we propose an alternative approach, called CIRCNV, for the detection of CNVs from sequencing data. CIRCNV is an extension of our previously developed method CNV-LOF, which uses local outlier factors to predict CNVs. Comparatively, CIRCNV can be performed on individual tumor samples and has the following two new features: (1) it transfers the read depth profile from a line shape to a circular shape via a polar coordinate transformation, in order to improve the efficiency of the read depth (RD) profile for the detection of CNVs; and (2) it performs a second round of CNV declaration based on the truth circular RD profile, which is recovered by estimating tumor purity. We test and validate the performance of CIRCNV based on simulation and real sequencing data and perform comparisons with several peer methods. The results demonstrate that CIRCNV can obtain superior performance in terms of sensitivity and precision. We expect that our proposed method will be a supplement to existing methods and become a routine tool in the field of variation analysis of tumor genomes.
format Online
Article
Text
id pubmed-8301091
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-83010912021-07-24 CIRCNV: Detection of CNVs Based on a Circular Profile of Read Depth from Sequencing Data Zhao, Hai-Yong Li, Qi Tian, Ye Chen, Yue-Hui Alvi, Haque A. K. Yuan, Xi-Guo Biology (Basel) Article SIMPLE SUMMARY: In this study, we propose a copy number variation (CNV) detection method called CIRCNV, which is based on a circular profile of the read depth from sequencing data. The proposed method is an extended version of our previously developed method CNV-LOF. The main difference of CIRCNV from CNV-LOF lies in its two new features: (1) it transfers the read depth profile from a line shape to a circular shape via a polar coordinate transformation to generate a meaningful two-dimensional dataset for CNV analysis and promote fairness between the ends and middle part of the genome, and (2) it performs two rounds of CNV declaration via estimating tumor purity and recovering the truth circular RD profile. We test and evaluate the performance of CIRCNV via conducting simulation studies and real sequencing tumor sample applications. The experimental results show that CIRCNV outperforms peer methods with respect to sensitivity, precision, and the F1-score. The experiments prove that the proposed method is a reliable and effective tool in the field of variation analysis of tumor genomes. ABSTRACT: Copy number variation (CNV) is a common type of structural variation in the human genome. Accurate detection of CNVs from tumor genomes can provide crucial information for the study of tumor genesis and cancer precision diagnosis. However, the contamination of normal genomes in tumor genomes and the crude profiles of the read depth make such a task difficult. In this paper, we propose an alternative approach, called CIRCNV, for the detection of CNVs from sequencing data. CIRCNV is an extension of our previously developed method CNV-LOF, which uses local outlier factors to predict CNVs. Comparatively, CIRCNV can be performed on individual tumor samples and has the following two new features: (1) it transfers the read depth profile from a line shape to a circular shape via a polar coordinate transformation, in order to improve the efficiency of the read depth (RD) profile for the detection of CNVs; and (2) it performs a second round of CNV declaration based on the truth circular RD profile, which is recovered by estimating tumor purity. We test and validate the performance of CIRCNV based on simulation and real sequencing data and perform comparisons with several peer methods. The results demonstrate that CIRCNV can obtain superior performance in terms of sensitivity and precision. We expect that our proposed method will be a supplement to existing methods and become a routine tool in the field of variation analysis of tumor genomes. MDPI 2021-06-25 /pmc/articles/PMC8301091/ /pubmed/34202028 http://dx.doi.org/10.3390/biology10070584 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhao, Hai-Yong
Li, Qi
Tian, Ye
Chen, Yue-Hui
Alvi, Haque A. K.
Yuan, Xi-Guo
CIRCNV: Detection of CNVs Based on a Circular Profile of Read Depth from Sequencing Data
title CIRCNV: Detection of CNVs Based on a Circular Profile of Read Depth from Sequencing Data
title_full CIRCNV: Detection of CNVs Based on a Circular Profile of Read Depth from Sequencing Data
title_fullStr CIRCNV: Detection of CNVs Based on a Circular Profile of Read Depth from Sequencing Data
title_full_unstemmed CIRCNV: Detection of CNVs Based on a Circular Profile of Read Depth from Sequencing Data
title_short CIRCNV: Detection of CNVs Based on a Circular Profile of Read Depth from Sequencing Data
title_sort circnv: detection of cnvs based on a circular profile of read depth from sequencing data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8301091/
https://www.ncbi.nlm.nih.gov/pubmed/34202028
http://dx.doi.org/10.3390/biology10070584
work_keys_str_mv AT zhaohaiyong circnvdetectionofcnvsbasedonacircularprofileofreaddepthfromsequencingdata
AT liqi circnvdetectionofcnvsbasedonacircularprofileofreaddepthfromsequencingdata
AT tianye circnvdetectionofcnvsbasedonacircularprofileofreaddepthfromsequencingdata
AT chenyuehui circnvdetectionofcnvsbasedonacircularprofileofreaddepthfromsequencingdata
AT alvihaqueak circnvdetectionofcnvsbasedonacircularprofileofreaddepthfromsequencingdata
AT yuanxiguo circnvdetectionofcnvsbasedonacircularprofileofreaddepthfromsequencingdata