Cargando…

Psi-Caller: A Lightweight Short Read-Based Variant Caller With High Speed and Accuracy

With the rapid development of short-read sequencing technologies, many population-scale resequencing studies have been carried out to study the associations between human genome variants and various phenotypes in recent years. Variant calling is one of the core bioinformatics tasks in such studies t...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Yadong, Jiang, Tao, Gao, Yan, Liu, Bo, Zang, Tianyi, Wang, Yadong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8414796/
https://www.ncbi.nlm.nih.gov/pubmed/34485311
http://dx.doi.org/10.3389/fcell.2021.731424
_version_ 1783747848665300992
author Liu, Yadong
Jiang, Tao
Gao, Yan
Liu, Bo
Zang, Tianyi
Wang, Yadong
author_facet Liu, Yadong
Jiang, Tao
Gao, Yan
Liu, Bo
Zang, Tianyi
Wang, Yadong
author_sort Liu, Yadong
collection PubMed
description With the rapid development of short-read sequencing technologies, many population-scale resequencing studies have been carried out to study the associations between human genome variants and various phenotypes in recent years. Variant calling is one of the core bioinformatics tasks in such studies to comprehensively discover genomic variants in sequenced samples. Many efforts have been made to develop short read-based variant calling approaches; however, state-of-the-art tools are still computationally expensive. Meanwhile, cutting-edge genomics studies also have higher requirements on the yields of variant calling. Herein, we propose Partial-Order Alignment-based single nucleotide polymorphism (SNV) and Indel caller (Psi-caller), a lightweight variant calling algorithm that simultaneously achieves high performance and yield. Mainly, Psi-caller recognizes and divides the candidate variant site into three categories according to the complexity and location of the signatures and employs various methods including binomial model, partial-order alignment, and de Bruijn graph-based local assembly to handle various categories of candidate variant sites to call and genotype SNVs/Indels, respectively. Benchmarks on simulated and real short-read sequencing data sets demonstrate that Psi-caller is times faster than state-of-the-art tools with higher or equal sensitivity and accuracy. It has the potential to well handle large-scale data sets in cutting-edge genomics studies.
format Online
Article
Text
id pubmed-8414796
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-84147962021-09-04 Psi-Caller: A Lightweight Short Read-Based Variant Caller With High Speed and Accuracy Liu, Yadong Jiang, Tao Gao, Yan Liu, Bo Zang, Tianyi Wang, Yadong Front Cell Dev Biol Cell and Developmental Biology With the rapid development of short-read sequencing technologies, many population-scale resequencing studies have been carried out to study the associations between human genome variants and various phenotypes in recent years. Variant calling is one of the core bioinformatics tasks in such studies to comprehensively discover genomic variants in sequenced samples. Many efforts have been made to develop short read-based variant calling approaches; however, state-of-the-art tools are still computationally expensive. Meanwhile, cutting-edge genomics studies also have higher requirements on the yields of variant calling. Herein, we propose Partial-Order Alignment-based single nucleotide polymorphism (SNV) and Indel caller (Psi-caller), a lightweight variant calling algorithm that simultaneously achieves high performance and yield. Mainly, Psi-caller recognizes and divides the candidate variant site into three categories according to the complexity and location of the signatures and employs various methods including binomial model, partial-order alignment, and de Bruijn graph-based local assembly to handle various categories of candidate variant sites to call and genotype SNVs/Indels, respectively. Benchmarks on simulated and real short-read sequencing data sets demonstrate that Psi-caller is times faster than state-of-the-art tools with higher or equal sensitivity and accuracy. It has the potential to well handle large-scale data sets in cutting-edge genomics studies. Frontiers Media S.A. 2021-08-13 /pmc/articles/PMC8414796/ /pubmed/34485311 http://dx.doi.org/10.3389/fcell.2021.731424 Text en Copyright © 2021 Liu, Jiang, Gao, Liu, Zang and Wang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Cell and Developmental Biology
Liu, Yadong
Jiang, Tao
Gao, Yan
Liu, Bo
Zang, Tianyi
Wang, Yadong
Psi-Caller: A Lightweight Short Read-Based Variant Caller With High Speed and Accuracy
title Psi-Caller: A Lightweight Short Read-Based Variant Caller With High Speed and Accuracy
title_full Psi-Caller: A Lightweight Short Read-Based Variant Caller With High Speed and Accuracy
title_fullStr Psi-Caller: A Lightweight Short Read-Based Variant Caller With High Speed and Accuracy
title_full_unstemmed Psi-Caller: A Lightweight Short Read-Based Variant Caller With High Speed and Accuracy
title_short Psi-Caller: A Lightweight Short Read-Based Variant Caller With High Speed and Accuracy
title_sort psi-caller: a lightweight short read-based variant caller with high speed and accuracy
topic Cell and Developmental Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8414796/
https://www.ncbi.nlm.nih.gov/pubmed/34485311
http://dx.doi.org/10.3389/fcell.2021.731424
work_keys_str_mv AT liuyadong psicalleralightweightshortreadbasedvariantcallerwithhighspeedandaccuracy
AT jiangtao psicalleralightweightshortreadbasedvariantcallerwithhighspeedandaccuracy
AT gaoyan psicalleralightweightshortreadbasedvariantcallerwithhighspeedandaccuracy
AT liubo psicalleralightweightshortreadbasedvariantcallerwithhighspeedandaccuracy
AT zangtianyi psicalleralightweightshortreadbasedvariantcallerwithhighspeedandaccuracy
AT wangyadong psicalleralightweightshortreadbasedvariantcallerwithhighspeedandaccuracy