Cargando…

3Cnet: pathogenicity prediction of human variants using multitask learning with evolutionary constraints

MOTIVATION: Improvements in next-generation sequencing have enabled genome-based diagnosis for patients with genetic diseases. However, accurate interpretation of human variants requires knowledge from a number of clinical cases. In addition, manual analysis of each variant detected in a patient...

Descripción completa

Detalles Bibliográficos
Autores principales: Won, Dhong-Gun, Kim, Dong-Wook, Woo, Junwoo, Lee, Kyoungyeul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8665754/
https://www.ncbi.nlm.nih.gov/pubmed/34270679
http://dx.doi.org/10.1093/bioinformatics/btab529
_version_ 1784614074286342144
author Won, Dhong-Gun
Kim, Dong-Wook
Woo, Junwoo
Lee, Kyoungyeul
author_facet Won, Dhong-Gun
Kim, Dong-Wook
Woo, Junwoo
Lee, Kyoungyeul
author_sort Won, Dhong-Gun
collection PubMed
description MOTIVATION: Improvements in next-generation sequencing have enabled genome-based diagnosis for patients with genetic diseases. However, accurate interpretation of human variants requires knowledge from a number of clinical cases. In addition, manual analysis of each variant detected in a patient's genome requires enormous time and effort. To reduce the cost of diagnosis, various computational tools have been developed to predict the pathogenicity of human variants, but the shortage and bias of available clinical data can lead to overfitting of algorithms. RESULTS: We developed a pathogenicity predictor, 3Cnet, that uses recurrent neural networks to analyze the amino acid context of human variants. As 3Cnet is trained on simulated variants reflecting evolutionary conservation and clinical data, it can find disease-causing variants in patient genomes with 2.2 times greater sensitivity than currently available tools, more effectively discovering pathogenic variants and thereby improving diagnosis rates. AVAILABILITY AND IMPLEMENTATION: Codes (https://github.com/KyoungYeulLee/3Cnet/) and data (https://zenodo.org/record/4716879#.YIO-xqkzZH1) are freely available to non-commercial users. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-8665754
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-86657542021-12-13 3Cnet: pathogenicity prediction of human variants using multitask learning with evolutionary constraints Won, Dhong-Gun Kim, Dong-Wook Woo, Junwoo Lee, Kyoungyeul Bioinformatics Original Papers MOTIVATION: Improvements in next-generation sequencing have enabled genome-based diagnosis for patients with genetic diseases. However, accurate interpretation of human variants requires knowledge from a number of clinical cases. In addition, manual analysis of each variant detected in a patient's genome requires enormous time and effort. To reduce the cost of diagnosis, various computational tools have been developed to predict the pathogenicity of human variants, but the shortage and bias of available clinical data can lead to overfitting of algorithms. RESULTS: We developed a pathogenicity predictor, 3Cnet, that uses recurrent neural networks to analyze the amino acid context of human variants. As 3Cnet is trained on simulated variants reflecting evolutionary conservation and clinical data, it can find disease-causing variants in patient genomes with 2.2 times greater sensitivity than currently available tools, more effectively discovering pathogenic variants and thereby improving diagnosis rates. AVAILABILITY AND IMPLEMENTATION: Codes (https://github.com/KyoungYeulLee/3Cnet/) and data (https://zenodo.org/record/4716879#.YIO-xqkzZH1) are freely available to non-commercial users. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-07-16 /pmc/articles/PMC8665754/ /pubmed/34270679 http://dx.doi.org/10.1093/bioinformatics/btab529 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Won, Dhong-Gun
Kim, Dong-Wook
Woo, Junwoo
Lee, Kyoungyeul
3Cnet: pathogenicity prediction of human variants using multitask learning with evolutionary constraints
title 3Cnet: pathogenicity prediction of human variants using multitask learning with evolutionary constraints
title_full 3Cnet: pathogenicity prediction of human variants using multitask learning with evolutionary constraints
title_fullStr 3Cnet: pathogenicity prediction of human variants using multitask learning with evolutionary constraints
title_full_unstemmed 3Cnet: pathogenicity prediction of human variants using multitask learning with evolutionary constraints
title_short 3Cnet: pathogenicity prediction of human variants using multitask learning with evolutionary constraints
title_sort 3cnet: pathogenicity prediction of human variants using multitask learning with evolutionary constraints
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8665754/
https://www.ncbi.nlm.nih.gov/pubmed/34270679
http://dx.doi.org/10.1093/bioinformatics/btab529
work_keys_str_mv AT wondhonggun 3cnetpathogenicitypredictionofhumanvariantsusingmultitasklearningwithevolutionaryconstraints
AT kimdongwook 3cnetpathogenicitypredictionofhumanvariantsusingmultitasklearningwithevolutionaryconstraints
AT woojunwoo 3cnetpathogenicitypredictionofhumanvariantsusingmultitasklearningwithevolutionaryconstraints
AT leekyoungyeul 3cnetpathogenicitypredictionofhumanvariantsusingmultitasklearningwithevolutionaryconstraints