Cargando…

Detection of copy number variations based on a local distance using next-generation sequencing data

As one of the main types of structural variation in the human genome, copy number variation (CNV) plays an important role in the occurrence and development of human cancers. Next-generation sequencing (NGS) technology can provide base-level resolution, which provides favorable conditions for the acc...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Guojun, Yang, Hongzhi, He, Zongzhen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10556732/
https://www.ncbi.nlm.nih.gov/pubmed/37811148
http://dx.doi.org/10.3389/fgene.2023.1147761
_version_ 1785116930229665792
author Liu, Guojun
Yang, Hongzhi
He, Zongzhen
author_facet Liu, Guojun
Yang, Hongzhi
He, Zongzhen
author_sort Liu, Guojun
collection PubMed
description As one of the main types of structural variation in the human genome, copy number variation (CNV) plays an important role in the occurrence and development of human cancers. Next-generation sequencing (NGS) technology can provide base-level resolution, which provides favorable conditions for the accurate detection of CNVs. However, it is still a very challenging task to accurately detect CNVs from cancer samples with different purity and low sequencing coverage. Local distance-based CNV detection (LDCNV), an innovative computational approach to predict CNVs using NGS data, is proposed in this work. LDCNV calculates the average distance between each read depth (RD) and its k nearest neighbors (KNNs) to define the distance of KNNs of each RD, and the average distance between the KNNs for each RD to define their internal distance. Based on the above definitions, a local distance score is constructed using the ratio between the distance of KNNs and the internal distance of KNNs for each RD. The local distance scores are used to fit a normal distribution to evaluate the significance level of each RDS, and then use the hypothesis test method to predict the CNVs. The performance of the proposed method is verified with simulated and real data and compared with several popular methods. The experimental results show that the proposed method is superior to various other techniques. Therefore, the proposed method can be helpful for cancer diagnosis and targeted drug development.
format Online
Article
Text
id pubmed-10556732
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-105567322023-10-07 Detection of copy number variations based on a local distance using next-generation sequencing data Liu, Guojun Yang, Hongzhi He, Zongzhen Front Genet Genetics As one of the main types of structural variation in the human genome, copy number variation (CNV) plays an important role in the occurrence and development of human cancers. Next-generation sequencing (NGS) technology can provide base-level resolution, which provides favorable conditions for the accurate detection of CNVs. However, it is still a very challenging task to accurately detect CNVs from cancer samples with different purity and low sequencing coverage. Local distance-based CNV detection (LDCNV), an innovative computational approach to predict CNVs using NGS data, is proposed in this work. LDCNV calculates the average distance between each read depth (RD) and its k nearest neighbors (KNNs) to define the distance of KNNs of each RD, and the average distance between the KNNs for each RD to define their internal distance. Based on the above definitions, a local distance score is constructed using the ratio between the distance of KNNs and the internal distance of KNNs for each RD. The local distance scores are used to fit a normal distribution to evaluate the significance level of each RDS, and then use the hypothesis test method to predict the CNVs. The performance of the proposed method is verified with simulated and real data and compared with several popular methods. The experimental results show that the proposed method is superior to various other techniques. Therefore, the proposed method can be helpful for cancer diagnosis and targeted drug development. Frontiers Media S.A. 2023-09-22 /pmc/articles/PMC10556732/ /pubmed/37811148 http://dx.doi.org/10.3389/fgene.2023.1147761 Text en Copyright © 2023 Liu, Yang and He. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Liu, Guojun
Yang, Hongzhi
He, Zongzhen
Detection of copy number variations based on a local distance using next-generation sequencing data
title Detection of copy number variations based on a local distance using next-generation sequencing data
title_full Detection of copy number variations based on a local distance using next-generation sequencing data
title_fullStr Detection of copy number variations based on a local distance using next-generation sequencing data
title_full_unstemmed Detection of copy number variations based on a local distance using next-generation sequencing data
title_short Detection of copy number variations based on a local distance using next-generation sequencing data
title_sort detection of copy number variations based on a local distance using next-generation sequencing data
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10556732/
https://www.ncbi.nlm.nih.gov/pubmed/37811148
http://dx.doi.org/10.3389/fgene.2023.1147761
work_keys_str_mv AT liuguojun detectionofcopynumbervariationsbasedonalocaldistanceusingnextgenerationsequencingdata
AT yanghongzhi detectionofcopynumbervariationsbasedonalocaldistanceusingnextgenerationsequencingdata
AT hezongzhen detectionofcopynumbervariationsbasedonalocaldistanceusingnextgenerationsequencingdata