Cargando…
Clair3-trio: high-performance Nanopore long-read variant calling in family trios with trio-to-trio deep neural networks
Accurate identification of genetic variants from family child–mother–father trio sequencing data is important in genomics. However, state-of-the-art approaches treat variant calling from trios as three independent tasks, which limits their calling accuracy for Nanopore long-read sequencing data. For...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9487642/ https://www.ncbi.nlm.nih.gov/pubmed/35849103 http://dx.doi.org/10.1093/bib/bbac301 |
_version_ | 1784792496564338688 |
---|---|
author | Su, Junhao Zheng, Zhenxian Ahmed, Syed Shakeel Lam, Tak-Wah Luo, Ruibang |
author_facet | Su, Junhao Zheng, Zhenxian Ahmed, Syed Shakeel Lam, Tak-Wah Luo, Ruibang |
author_sort | Su, Junhao |
collection | PubMed |
description | Accurate identification of genetic variants from family child–mother–father trio sequencing data is important in genomics. However, state-of-the-art approaches treat variant calling from trios as three independent tasks, which limits their calling accuracy for Nanopore long-read sequencing data. For better trio variant calling, we introduce Clair3-Trio, the first variant caller tailored for family trio data from Nanopore long-reads. Clair3-Trio employs a Trio-to-Trio deep neural network model, which allows it to input the trio sequencing information and output all of the trio’s predicted variants within a single model to improve variant calling. We also present MCVLoss, a novel loss function tailor-made for variant calling in trios, leveraging the explicit encoding of the Mendelian inheritance. Clair3-Trio showed comprehensive improvement in experiments. It predicted far fewer Mendelian inheritance violation variations than current state-of-the-art methods. We also demonstrated that our Trio-to-Trio model is more accurate than competing architectures. Clair3-Trio is accessible as a free, open-source project at https://github.com/HKU-BAL/Clair3-Trio. |
format | Online Article Text |
id | pubmed-9487642 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-94876422022-09-21 Clair3-trio: high-performance Nanopore long-read variant calling in family trios with trio-to-trio deep neural networks Su, Junhao Zheng, Zhenxian Ahmed, Syed Shakeel Lam, Tak-Wah Luo, Ruibang Brief Bioinform Problem Solving Protocol Accurate identification of genetic variants from family child–mother–father trio sequencing data is important in genomics. However, state-of-the-art approaches treat variant calling from trios as three independent tasks, which limits their calling accuracy for Nanopore long-read sequencing data. For better trio variant calling, we introduce Clair3-Trio, the first variant caller tailored for family trio data from Nanopore long-reads. Clair3-Trio employs a Trio-to-Trio deep neural network model, which allows it to input the trio sequencing information and output all of the trio’s predicted variants within a single model to improve variant calling. We also present MCVLoss, a novel loss function tailor-made for variant calling in trios, leveraging the explicit encoding of the Mendelian inheritance. Clair3-Trio showed comprehensive improvement in experiments. It predicted far fewer Mendelian inheritance violation variations than current state-of-the-art methods. We also demonstrated that our Trio-to-Trio model is more accurate than competing architectures. Clair3-Trio is accessible as a free, open-source project at https://github.com/HKU-BAL/Clair3-Trio. Oxford University Press 2022-07-17 /pmc/articles/PMC9487642/ /pubmed/35849103 http://dx.doi.org/10.1093/bib/bbac301 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Problem Solving Protocol Su, Junhao Zheng, Zhenxian Ahmed, Syed Shakeel Lam, Tak-Wah Luo, Ruibang Clair3-trio: high-performance Nanopore long-read variant calling in family trios with trio-to-trio deep neural networks |
title | Clair3-trio: high-performance Nanopore long-read variant calling in family trios with trio-to-trio deep neural networks |
title_full | Clair3-trio: high-performance Nanopore long-read variant calling in family trios with trio-to-trio deep neural networks |
title_fullStr | Clair3-trio: high-performance Nanopore long-read variant calling in family trios with trio-to-trio deep neural networks |
title_full_unstemmed | Clair3-trio: high-performance Nanopore long-read variant calling in family trios with trio-to-trio deep neural networks |
title_short | Clair3-trio: high-performance Nanopore long-read variant calling in family trios with trio-to-trio deep neural networks |
title_sort | clair3-trio: high-performance nanopore long-read variant calling in family trios with trio-to-trio deep neural networks |
topic | Problem Solving Protocol |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9487642/ https://www.ncbi.nlm.nih.gov/pubmed/35849103 http://dx.doi.org/10.1093/bib/bbac301 |
work_keys_str_mv | AT sujunhao clair3triohighperformancenanoporelongreadvariantcallinginfamilytrioswithtriototriodeepneuralnetworks AT zhengzhenxian clair3triohighperformancenanoporelongreadvariantcallinginfamilytrioswithtriototriodeepneuralnetworks AT ahmedsyedshakeel clair3triohighperformancenanoporelongreadvariantcallinginfamilytrioswithtriototriodeepneuralnetworks AT lamtakwah clair3triohighperformancenanoporelongreadvariantcallinginfamilytrioswithtriototriodeepneuralnetworks AT luoruibang clair3triohighperformancenanoporelongreadvariantcallinginfamilytrioswithtriototriodeepneuralnetworks |