Cargando…
Psi-Caller: A Lightweight Short Read-Based Variant Caller With High Speed and Accuracy
With the rapid development of short-read sequencing technologies, many population-scale resequencing studies have been carried out to study the associations between human genome variants and various phenotypes in recent years. Variant calling is one of the core bioinformatics tasks in such studies t...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8414796/ https://www.ncbi.nlm.nih.gov/pubmed/34485311 http://dx.doi.org/10.3389/fcell.2021.731424 |
_version_ | 1783747848665300992 |
---|---|
author | Liu, Yadong Jiang, Tao Gao, Yan Liu, Bo Zang, Tianyi Wang, Yadong |
author_facet | Liu, Yadong Jiang, Tao Gao, Yan Liu, Bo Zang, Tianyi Wang, Yadong |
author_sort | Liu, Yadong |
collection | PubMed |
description | With the rapid development of short-read sequencing technologies, many population-scale resequencing studies have been carried out to study the associations between human genome variants and various phenotypes in recent years. Variant calling is one of the core bioinformatics tasks in such studies to comprehensively discover genomic variants in sequenced samples. Many efforts have been made to develop short read-based variant calling approaches; however, state-of-the-art tools are still computationally expensive. Meanwhile, cutting-edge genomics studies also have higher requirements on the yields of variant calling. Herein, we propose Partial-Order Alignment-based single nucleotide polymorphism (SNV) and Indel caller (Psi-caller), a lightweight variant calling algorithm that simultaneously achieves high performance and yield. Mainly, Psi-caller recognizes and divides the candidate variant site into three categories according to the complexity and location of the signatures and employs various methods including binomial model, partial-order alignment, and de Bruijn graph-based local assembly to handle various categories of candidate variant sites to call and genotype SNVs/Indels, respectively. Benchmarks on simulated and real short-read sequencing data sets demonstrate that Psi-caller is times faster than state-of-the-art tools with higher or equal sensitivity and accuracy. It has the potential to well handle large-scale data sets in cutting-edge genomics studies. |
format | Online Article Text |
id | pubmed-8414796 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-84147962021-09-04 Psi-Caller: A Lightweight Short Read-Based Variant Caller With High Speed and Accuracy Liu, Yadong Jiang, Tao Gao, Yan Liu, Bo Zang, Tianyi Wang, Yadong Front Cell Dev Biol Cell and Developmental Biology With the rapid development of short-read sequencing technologies, many population-scale resequencing studies have been carried out to study the associations between human genome variants and various phenotypes in recent years. Variant calling is one of the core bioinformatics tasks in such studies to comprehensively discover genomic variants in sequenced samples. Many efforts have been made to develop short read-based variant calling approaches; however, state-of-the-art tools are still computationally expensive. Meanwhile, cutting-edge genomics studies also have higher requirements on the yields of variant calling. Herein, we propose Partial-Order Alignment-based single nucleotide polymorphism (SNV) and Indel caller (Psi-caller), a lightweight variant calling algorithm that simultaneously achieves high performance and yield. Mainly, Psi-caller recognizes and divides the candidate variant site into three categories according to the complexity and location of the signatures and employs various methods including binomial model, partial-order alignment, and de Bruijn graph-based local assembly to handle various categories of candidate variant sites to call and genotype SNVs/Indels, respectively. Benchmarks on simulated and real short-read sequencing data sets demonstrate that Psi-caller is times faster than state-of-the-art tools with higher or equal sensitivity and accuracy. It has the potential to well handle large-scale data sets in cutting-edge genomics studies. Frontiers Media S.A. 2021-08-13 /pmc/articles/PMC8414796/ /pubmed/34485311 http://dx.doi.org/10.3389/fcell.2021.731424 Text en Copyright © 2021 Liu, Jiang, Gao, Liu, Zang and Wang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Cell and Developmental Biology Liu, Yadong Jiang, Tao Gao, Yan Liu, Bo Zang, Tianyi Wang, Yadong Psi-Caller: A Lightweight Short Read-Based Variant Caller With High Speed and Accuracy |
title | Psi-Caller: A Lightweight Short Read-Based Variant Caller With High Speed and Accuracy |
title_full | Psi-Caller: A Lightweight Short Read-Based Variant Caller With High Speed and Accuracy |
title_fullStr | Psi-Caller: A Lightweight Short Read-Based Variant Caller With High Speed and Accuracy |
title_full_unstemmed | Psi-Caller: A Lightweight Short Read-Based Variant Caller With High Speed and Accuracy |
title_short | Psi-Caller: A Lightweight Short Read-Based Variant Caller With High Speed and Accuracy |
title_sort | psi-caller: a lightweight short read-based variant caller with high speed and accuracy |
topic | Cell and Developmental Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8414796/ https://www.ncbi.nlm.nih.gov/pubmed/34485311 http://dx.doi.org/10.3389/fcell.2021.731424 |
work_keys_str_mv | AT liuyadong psicalleralightweightshortreadbasedvariantcallerwithhighspeedandaccuracy AT jiangtao psicalleralightweightshortreadbasedvariantcallerwithhighspeedandaccuracy AT gaoyan psicalleralightweightshortreadbasedvariantcallerwithhighspeedandaccuracy AT liubo psicalleralightweightshortreadbasedvariantcallerwithhighspeedandaccuracy AT zangtianyi psicalleralightweightshortreadbasedvariantcallerwithhighspeedandaccuracy AT wangyadong psicalleralightweightshortreadbasedvariantcallerwithhighspeedandaccuracy |