Cargando…

Detecting and phasing minor single-nucleotide variants from long-read sequencing data

Cellular genetic heterogeneity is common in many biological conditions including cancer, microbiome, and co-infection of multiple pathogens. Detecting and phasing minor variants play an instrumental role in deciphering cellular genetic heterogeneity, but they are still difficult tasks because of tec...

Descripción completa

Detalles Bibliográficos
Autores principales: Feng, Zhixing, Clemente, Jose C., Wong, Brandon, Schadt, Eric E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8144375/
https://www.ncbi.nlm.nih.gov/pubmed/34031367
http://dx.doi.org/10.1038/s41467-021-23289-4
_version_ 1783696943330885632
author Feng, Zhixing
Clemente, Jose C.
Wong, Brandon
Schadt, Eric E.
author_facet Feng, Zhixing
Clemente, Jose C.
Wong, Brandon
Schadt, Eric E.
author_sort Feng, Zhixing
collection PubMed
description Cellular genetic heterogeneity is common in many biological conditions including cancer, microbiome, and co-infection of multiple pathogens. Detecting and phasing minor variants play an instrumental role in deciphering cellular genetic heterogeneity, but they are still difficult tasks because of technological limitations. Recently, long-read sequencing technologies, including those by Pacific Biosciences and Oxford Nanopore, provide an opportunity to tackle these challenges. However, high error rates make it difficult to take full advantage of these technologies. To fill this gap, we introduce iGDA, an open-source tool that can accurately detect and phase minor single-nucleotide variants (SNVs), whose frequencies are as low as 0.2%, from raw long-read sequencing data. We also demonstrate that iGDA can accurately reconstruct haplotypes in closely related strains of the same species (divergence ≥0.011%) from long-read metagenomic data.
format Online
Article
Text
id pubmed-8144375
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-81443752021-06-07 Detecting and phasing minor single-nucleotide variants from long-read sequencing data Feng, Zhixing Clemente, Jose C. Wong, Brandon Schadt, Eric E. Nat Commun Article Cellular genetic heterogeneity is common in many biological conditions including cancer, microbiome, and co-infection of multiple pathogens. Detecting and phasing minor variants play an instrumental role in deciphering cellular genetic heterogeneity, but they are still difficult tasks because of technological limitations. Recently, long-read sequencing technologies, including those by Pacific Biosciences and Oxford Nanopore, provide an opportunity to tackle these challenges. However, high error rates make it difficult to take full advantage of these technologies. To fill this gap, we introduce iGDA, an open-source tool that can accurately detect and phase minor single-nucleotide variants (SNVs), whose frequencies are as low as 0.2%, from raw long-read sequencing data. We also demonstrate that iGDA can accurately reconstruct haplotypes in closely related strains of the same species (divergence ≥0.011%) from long-read metagenomic data. Nature Publishing Group UK 2021-05-24 /pmc/articles/PMC8144375/ /pubmed/34031367 http://dx.doi.org/10.1038/s41467-021-23289-4 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Feng, Zhixing
Clemente, Jose C.
Wong, Brandon
Schadt, Eric E.
Detecting and phasing minor single-nucleotide variants from long-read sequencing data
title Detecting and phasing minor single-nucleotide variants from long-read sequencing data
title_full Detecting and phasing minor single-nucleotide variants from long-read sequencing data
title_fullStr Detecting and phasing minor single-nucleotide variants from long-read sequencing data
title_full_unstemmed Detecting and phasing minor single-nucleotide variants from long-read sequencing data
title_short Detecting and phasing minor single-nucleotide variants from long-read sequencing data
title_sort detecting and phasing minor single-nucleotide variants from long-read sequencing data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8144375/
https://www.ncbi.nlm.nih.gov/pubmed/34031367
http://dx.doi.org/10.1038/s41467-021-23289-4
work_keys_str_mv AT fengzhixing detectingandphasingminorsinglenucleotidevariantsfromlongreadsequencingdata
AT clementejosec detectingandphasingminorsinglenucleotidevariantsfromlongreadsequencingdata
AT wongbrandon detectingandphasingminorsinglenucleotidevariantsfromlongreadsequencingdata
AT schadterice detectingandphasingminorsinglenucleotidevariantsfromlongreadsequencingdata