Cargando…
Detecting and phasing minor single-nucleotide variants from long-read sequencing data
Cellular genetic heterogeneity is common in many biological conditions including cancer, microbiome, and co-infection of multiple pathogens. Detecting and phasing minor variants play an instrumental role in deciphering cellular genetic heterogeneity, but they are still difficult tasks because of tec...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8144375/ https://www.ncbi.nlm.nih.gov/pubmed/34031367 http://dx.doi.org/10.1038/s41467-021-23289-4 |
_version_ | 1783696943330885632 |
---|---|
author | Feng, Zhixing Clemente, Jose C. Wong, Brandon Schadt, Eric E. |
author_facet | Feng, Zhixing Clemente, Jose C. Wong, Brandon Schadt, Eric E. |
author_sort | Feng, Zhixing |
collection | PubMed |
description | Cellular genetic heterogeneity is common in many biological conditions including cancer, microbiome, and co-infection of multiple pathogens. Detecting and phasing minor variants play an instrumental role in deciphering cellular genetic heterogeneity, but they are still difficult tasks because of technological limitations. Recently, long-read sequencing technologies, including those by Pacific Biosciences and Oxford Nanopore, provide an opportunity to tackle these challenges. However, high error rates make it difficult to take full advantage of these technologies. To fill this gap, we introduce iGDA, an open-source tool that can accurately detect and phase minor single-nucleotide variants (SNVs), whose frequencies are as low as 0.2%, from raw long-read sequencing data. We also demonstrate that iGDA can accurately reconstruct haplotypes in closely related strains of the same species (divergence ≥0.011%) from long-read metagenomic data. |
format | Online Article Text |
id | pubmed-8144375 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-81443752021-06-07 Detecting and phasing minor single-nucleotide variants from long-read sequencing data Feng, Zhixing Clemente, Jose C. Wong, Brandon Schadt, Eric E. Nat Commun Article Cellular genetic heterogeneity is common in many biological conditions including cancer, microbiome, and co-infection of multiple pathogens. Detecting and phasing minor variants play an instrumental role in deciphering cellular genetic heterogeneity, but they are still difficult tasks because of technological limitations. Recently, long-read sequencing technologies, including those by Pacific Biosciences and Oxford Nanopore, provide an opportunity to tackle these challenges. However, high error rates make it difficult to take full advantage of these technologies. To fill this gap, we introduce iGDA, an open-source tool that can accurately detect and phase minor single-nucleotide variants (SNVs), whose frequencies are as low as 0.2%, from raw long-read sequencing data. We also demonstrate that iGDA can accurately reconstruct haplotypes in closely related strains of the same species (divergence ≥0.011%) from long-read metagenomic data. Nature Publishing Group UK 2021-05-24 /pmc/articles/PMC8144375/ /pubmed/34031367 http://dx.doi.org/10.1038/s41467-021-23289-4 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Feng, Zhixing Clemente, Jose C. Wong, Brandon Schadt, Eric E. Detecting and phasing minor single-nucleotide variants from long-read sequencing data |
title | Detecting and phasing minor single-nucleotide variants from long-read sequencing data |
title_full | Detecting and phasing minor single-nucleotide variants from long-read sequencing data |
title_fullStr | Detecting and phasing minor single-nucleotide variants from long-read sequencing data |
title_full_unstemmed | Detecting and phasing minor single-nucleotide variants from long-read sequencing data |
title_short | Detecting and phasing minor single-nucleotide variants from long-read sequencing data |
title_sort | detecting and phasing minor single-nucleotide variants from long-read sequencing data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8144375/ https://www.ncbi.nlm.nih.gov/pubmed/34031367 http://dx.doi.org/10.1038/s41467-021-23289-4 |
work_keys_str_mv | AT fengzhixing detectingandphasingminorsinglenucleotidevariantsfromlongreadsequencingdata AT clementejosec detectingandphasingminorsinglenucleotidevariantsfromlongreadsequencingdata AT wongbrandon detectingandphasingminorsinglenucleotidevariantsfromlongreadsequencingdata AT schadterice detectingandphasingminorsinglenucleotidevariantsfromlongreadsequencingdata |