Cargando…
Detection of copy number variations based on a local distance using next-generation sequencing data
As one of the main types of structural variation in the human genome, copy number variation (CNV) plays an important role in the occurrence and development of human cancers. Next-generation sequencing (NGS) technology can provide base-level resolution, which provides favorable conditions for the acc...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10556732/ https://www.ncbi.nlm.nih.gov/pubmed/37811148 http://dx.doi.org/10.3389/fgene.2023.1147761 |
_version_ | 1785116930229665792 |
---|---|
author | Liu, Guojun Yang, Hongzhi He, Zongzhen |
author_facet | Liu, Guojun Yang, Hongzhi He, Zongzhen |
author_sort | Liu, Guojun |
collection | PubMed |
description | As one of the main types of structural variation in the human genome, copy number variation (CNV) plays an important role in the occurrence and development of human cancers. Next-generation sequencing (NGS) technology can provide base-level resolution, which provides favorable conditions for the accurate detection of CNVs. However, it is still a very challenging task to accurately detect CNVs from cancer samples with different purity and low sequencing coverage. Local distance-based CNV detection (LDCNV), an innovative computational approach to predict CNVs using NGS data, is proposed in this work. LDCNV calculates the average distance between each read depth (RD) and its k nearest neighbors (KNNs) to define the distance of KNNs of each RD, and the average distance between the KNNs for each RD to define their internal distance. Based on the above definitions, a local distance score is constructed using the ratio between the distance of KNNs and the internal distance of KNNs for each RD. The local distance scores are used to fit a normal distribution to evaluate the significance level of each RDS, and then use the hypothesis test method to predict the CNVs. The performance of the proposed method is verified with simulated and real data and compared with several popular methods. The experimental results show that the proposed method is superior to various other techniques. Therefore, the proposed method can be helpful for cancer diagnosis and targeted drug development. |
format | Online Article Text |
id | pubmed-10556732 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-105567322023-10-07 Detection of copy number variations based on a local distance using next-generation sequencing data Liu, Guojun Yang, Hongzhi He, Zongzhen Front Genet Genetics As one of the main types of structural variation in the human genome, copy number variation (CNV) plays an important role in the occurrence and development of human cancers. Next-generation sequencing (NGS) technology can provide base-level resolution, which provides favorable conditions for the accurate detection of CNVs. However, it is still a very challenging task to accurately detect CNVs from cancer samples with different purity and low sequencing coverage. Local distance-based CNV detection (LDCNV), an innovative computational approach to predict CNVs using NGS data, is proposed in this work. LDCNV calculates the average distance between each read depth (RD) and its k nearest neighbors (KNNs) to define the distance of KNNs of each RD, and the average distance between the KNNs for each RD to define their internal distance. Based on the above definitions, a local distance score is constructed using the ratio between the distance of KNNs and the internal distance of KNNs for each RD. The local distance scores are used to fit a normal distribution to evaluate the significance level of each RDS, and then use the hypothesis test method to predict the CNVs. The performance of the proposed method is verified with simulated and real data and compared with several popular methods. The experimental results show that the proposed method is superior to various other techniques. Therefore, the proposed method can be helpful for cancer diagnosis and targeted drug development. Frontiers Media S.A. 2023-09-22 /pmc/articles/PMC10556732/ /pubmed/37811148 http://dx.doi.org/10.3389/fgene.2023.1147761 Text en Copyright © 2023 Liu, Yang and He. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Liu, Guojun Yang, Hongzhi He, Zongzhen Detection of copy number variations based on a local distance using next-generation sequencing data |
title | Detection of copy number variations based on a local distance using next-generation sequencing data |
title_full | Detection of copy number variations based on a local distance using next-generation sequencing data |
title_fullStr | Detection of copy number variations based on a local distance using next-generation sequencing data |
title_full_unstemmed | Detection of copy number variations based on a local distance using next-generation sequencing data |
title_short | Detection of copy number variations based on a local distance using next-generation sequencing data |
title_sort | detection of copy number variations based on a local distance using next-generation sequencing data |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10556732/ https://www.ncbi.nlm.nih.gov/pubmed/37811148 http://dx.doi.org/10.3389/fgene.2023.1147761 |
work_keys_str_mv | AT liuguojun detectionofcopynumbervariationsbasedonalocaldistanceusingnextgenerationsequencingdata AT yanghongzhi detectionofcopynumbervariationsbasedonalocaldistanceusingnextgenerationsequencingdata AT hezongzhen detectionofcopynumbervariationsbasedonalocaldistanceusingnextgenerationsequencingdata |