Cargando…

Comparison of three bioinformatics tools in the detection of ASD candidate variants from whole exome sequencing data

Autism spectrum disorder (ASD) is a heterogenous multifactorial neurodevelopmental condition with a significant genetic susceptibility component. Thus, identifying genetic variations associated with ASD is a complex task. Whole-exome sequencing (WES) is an effective approach for detecting extremely...

Descripción completa

Detalles Bibliográficos
Autores principales: Shil, Apurba, Levin, Liron, Golan, Hava, Meiri, Gal, Michaelovski, Analya, Sadaka, Yair, Aran, Adi, Dinstein, Ilan, Menashe, Idan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10620213/
https://www.ncbi.nlm.nih.gov/pubmed/37914828
http://dx.doi.org/10.1038/s41598-023-46258-x
_version_ 1785130157981302784
author Shil, Apurba
Levin, Liron
Golan, Hava
Meiri, Gal
Michaelovski, Analya
Sadaka, Yair
Aran, Adi
Dinstein, Ilan
Menashe, Idan
author_facet Shil, Apurba
Levin, Liron
Golan, Hava
Meiri, Gal
Michaelovski, Analya
Sadaka, Yair
Aran, Adi
Dinstein, Ilan
Menashe, Idan
author_sort Shil, Apurba
collection PubMed
description Autism spectrum disorder (ASD) is a heterogenous multifactorial neurodevelopmental condition with a significant genetic susceptibility component. Thus, identifying genetic variations associated with ASD is a complex task. Whole-exome sequencing (WES) is an effective approach for detecting extremely rare protein-coding single-nucleotide variants (SNVs) and short insertions/deletions (INDELs). However, interpreting these variants' functional and clinical consequences requires integrating multifaceted genomic information. We compared the concordance and effectiveness of three bioinformatics tools in detecting ASD candidate variants (SNVs and short INDELs) from WES data of 220 ASD family trios registered in the National Autism Database of Israel. We studied only rare (< 1% population frequency) proband-specific variants. According to the American College of Medical Genetics (ACMG) guidelines, the pathogenicity of variants was evaluated by the InterVar and TAPES tools. In addition, likely gene-disrupting (LGD) variants were detected based on an in-house bioinformatics tool, Psi-Variant, that integrates results from seven in-silico prediction tools. Overall, 372 variants in 311 genes distributed in 168 probands were detected by these tools. The overlap between the tools was 64.1, 22.9, and 23.1% for InterVar–TAPES, InterVar–Psi-Variant, and TAPES–Psi-Variant, respectively. The intersection between InterVar and Psi-Variant (I ∩ P) was the most effective approach in detecting variants in known ASD genes (PPV = 0.274; OR = 7.09, 95% CI = 3.92–12.22), while the union of InterVar and Psi Variant (I U P) achieved the highest diagnostic yield (20.5%).Our results suggest that integrating different variant interpretation approaches in detecting ASD candidate variants from WES data is superior to each approach alone. The inclusion of additional criteria could further improve the detection of ASD candidate variants.
format Online
Article
Text
id pubmed-10620213
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-106202132023-11-03 Comparison of three bioinformatics tools in the detection of ASD candidate variants from whole exome sequencing data Shil, Apurba Levin, Liron Golan, Hava Meiri, Gal Michaelovski, Analya Sadaka, Yair Aran, Adi Dinstein, Ilan Menashe, Idan Sci Rep Article Autism spectrum disorder (ASD) is a heterogenous multifactorial neurodevelopmental condition with a significant genetic susceptibility component. Thus, identifying genetic variations associated with ASD is a complex task. Whole-exome sequencing (WES) is an effective approach for detecting extremely rare protein-coding single-nucleotide variants (SNVs) and short insertions/deletions (INDELs). However, interpreting these variants' functional and clinical consequences requires integrating multifaceted genomic information. We compared the concordance and effectiveness of three bioinformatics tools in detecting ASD candidate variants (SNVs and short INDELs) from WES data of 220 ASD family trios registered in the National Autism Database of Israel. We studied only rare (< 1% population frequency) proband-specific variants. According to the American College of Medical Genetics (ACMG) guidelines, the pathogenicity of variants was evaluated by the InterVar and TAPES tools. In addition, likely gene-disrupting (LGD) variants were detected based on an in-house bioinformatics tool, Psi-Variant, that integrates results from seven in-silico prediction tools. Overall, 372 variants in 311 genes distributed in 168 probands were detected by these tools. The overlap between the tools was 64.1, 22.9, and 23.1% for InterVar–TAPES, InterVar–Psi-Variant, and TAPES–Psi-Variant, respectively. The intersection between InterVar and Psi-Variant (I ∩ P) was the most effective approach in detecting variants in known ASD genes (PPV = 0.274; OR = 7.09, 95% CI = 3.92–12.22), while the union of InterVar and Psi Variant (I U P) achieved the highest diagnostic yield (20.5%).Our results suggest that integrating different variant interpretation approaches in detecting ASD candidate variants from WES data is superior to each approach alone. The inclusion of additional criteria could further improve the detection of ASD candidate variants. Nature Publishing Group UK 2023-11-01 /pmc/articles/PMC10620213/ /pubmed/37914828 http://dx.doi.org/10.1038/s41598-023-46258-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Shil, Apurba
Levin, Liron
Golan, Hava
Meiri, Gal
Michaelovski, Analya
Sadaka, Yair
Aran, Adi
Dinstein, Ilan
Menashe, Idan
Comparison of three bioinformatics tools in the detection of ASD candidate variants from whole exome sequencing data
title Comparison of three bioinformatics tools in the detection of ASD candidate variants from whole exome sequencing data
title_full Comparison of three bioinformatics tools in the detection of ASD candidate variants from whole exome sequencing data
title_fullStr Comparison of three bioinformatics tools in the detection of ASD candidate variants from whole exome sequencing data
title_full_unstemmed Comparison of three bioinformatics tools in the detection of ASD candidate variants from whole exome sequencing data
title_short Comparison of three bioinformatics tools in the detection of ASD candidate variants from whole exome sequencing data
title_sort comparison of three bioinformatics tools in the detection of asd candidate variants from whole exome sequencing data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10620213/
https://www.ncbi.nlm.nih.gov/pubmed/37914828
http://dx.doi.org/10.1038/s41598-023-46258-x
work_keys_str_mv AT shilapurba comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata
AT levinliron comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata
AT golanhava comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata
AT meirigal comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata
AT michaelovskianalya comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata
AT sadakayair comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata
AT aranadi comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata
AT dinsteinilan comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata
AT menasheidan comparisonofthreebioinformaticstoolsinthedetectionofasdcandidatevariantsfromwholeexomesequencingdata