Cargando…

Clair3-trio: high-performance Nanopore long-read variant calling in family trios with trio-to-trio deep neural networks

Accurate identification of genetic variants from family child–mother–father trio sequencing data is important in genomics. However, state-of-the-art approaches treat variant calling from trios as three independent tasks, which limits their calling accuracy for Nanopore long-read sequencing data. For...

Descripción completa

Detalles Bibliográficos
Autores principales: Su, Junhao, Zheng, Zhenxian, Ahmed, Syed Shakeel, Lam, Tak-Wah, Luo, Ruibang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9487642/
https://www.ncbi.nlm.nih.gov/pubmed/35849103
http://dx.doi.org/10.1093/bib/bbac301
_version_ 1784792496564338688
author Su, Junhao
Zheng, Zhenxian
Ahmed, Syed Shakeel
Lam, Tak-Wah
Luo, Ruibang
author_facet Su, Junhao
Zheng, Zhenxian
Ahmed, Syed Shakeel
Lam, Tak-Wah
Luo, Ruibang
author_sort Su, Junhao
collection PubMed
description Accurate identification of genetic variants from family child–mother–father trio sequencing data is important in genomics. However, state-of-the-art approaches treat variant calling from trios as three independent tasks, which limits their calling accuracy for Nanopore long-read sequencing data. For better trio variant calling, we introduce Clair3-Trio, the first variant caller tailored for family trio data from Nanopore long-reads. Clair3-Trio employs a Trio-to-Trio deep neural network model, which allows it to input the trio sequencing information and output all of the trio’s predicted variants within a single model to improve variant calling. We also present MCVLoss, a novel loss function tailor-made for variant calling in trios, leveraging the explicit encoding of the Mendelian inheritance. Clair3-Trio showed comprehensive improvement in experiments. It predicted far fewer Mendelian inheritance violation variations than current state-of-the-art methods. We also demonstrated that our Trio-to-Trio model is more accurate than competing architectures. Clair3-Trio is accessible as a free, open-source project at https://github.com/HKU-BAL/Clair3-Trio.
format Online
Article
Text
id pubmed-9487642
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-94876422022-09-21 Clair3-trio: high-performance Nanopore long-read variant calling in family trios with trio-to-trio deep neural networks Su, Junhao Zheng, Zhenxian Ahmed, Syed Shakeel Lam, Tak-Wah Luo, Ruibang Brief Bioinform Problem Solving Protocol Accurate identification of genetic variants from family child–mother–father trio sequencing data is important in genomics. However, state-of-the-art approaches treat variant calling from trios as three independent tasks, which limits their calling accuracy for Nanopore long-read sequencing data. For better trio variant calling, we introduce Clair3-Trio, the first variant caller tailored for family trio data from Nanopore long-reads. Clair3-Trio employs a Trio-to-Trio deep neural network model, which allows it to input the trio sequencing information and output all of the trio’s predicted variants within a single model to improve variant calling. We also present MCVLoss, a novel loss function tailor-made for variant calling in trios, leveraging the explicit encoding of the Mendelian inheritance. Clair3-Trio showed comprehensive improvement in experiments. It predicted far fewer Mendelian inheritance violation variations than current state-of-the-art methods. We also demonstrated that our Trio-to-Trio model is more accurate than competing architectures. Clair3-Trio is accessible as a free, open-source project at https://github.com/HKU-BAL/Clair3-Trio. Oxford University Press 2022-07-17 /pmc/articles/PMC9487642/ /pubmed/35849103 http://dx.doi.org/10.1093/bib/bbac301 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Problem Solving Protocol
Su, Junhao
Zheng, Zhenxian
Ahmed, Syed Shakeel
Lam, Tak-Wah
Luo, Ruibang
Clair3-trio: high-performance Nanopore long-read variant calling in family trios with trio-to-trio deep neural networks
title Clair3-trio: high-performance Nanopore long-read variant calling in family trios with trio-to-trio deep neural networks
title_full Clair3-trio: high-performance Nanopore long-read variant calling in family trios with trio-to-trio deep neural networks
title_fullStr Clair3-trio: high-performance Nanopore long-read variant calling in family trios with trio-to-trio deep neural networks
title_full_unstemmed Clair3-trio: high-performance Nanopore long-read variant calling in family trios with trio-to-trio deep neural networks
title_short Clair3-trio: high-performance Nanopore long-read variant calling in family trios with trio-to-trio deep neural networks
title_sort clair3-trio: high-performance nanopore long-read variant calling in family trios with trio-to-trio deep neural networks
topic Problem Solving Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9487642/
https://www.ncbi.nlm.nih.gov/pubmed/35849103
http://dx.doi.org/10.1093/bib/bbac301
work_keys_str_mv AT sujunhao clair3triohighperformancenanoporelongreadvariantcallinginfamilytrioswithtriototriodeepneuralnetworks
AT zhengzhenxian clair3triohighperformancenanoporelongreadvariantcallinginfamilytrioswithtriototriodeepneuralnetworks
AT ahmedsyedshakeel clair3triohighperformancenanoporelongreadvariantcallinginfamilytrioswithtriototriodeepneuralnetworks
AT lamtakwah clair3triohighperformancenanoporelongreadvariantcallinginfamilytrioswithtriototriodeepneuralnetworks
AT luoruibang clair3triohighperformancenanoporelongreadvariantcallinginfamilytrioswithtriototriodeepneuralnetworks