Cargando…

Classification of non-coding variants with high pathogenic impact

Whole genome sequencing is increasingly used to diagnose medical conditions of genetic origin. While both coding and non-coding DNA variants contribute to a wide range of diseases, most patients who receive a WGS-based diagnosis today harbour a protein-coding mutation. Functional interpretation and...

Descripción completa

Detalles Bibliográficos
Autores principales: Moyon, Lambert, Berthelot, Camille, Louis, Alexandra, Nguyen, Nga Thi Thuy, Roest Crollius, Hugues
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9094564/
https://www.ncbi.nlm.nih.gov/pubmed/35486646
http://dx.doi.org/10.1371/journal.pgen.1010191
_version_ 1784705566966284288
author Moyon, Lambert
Berthelot, Camille
Louis, Alexandra
Nguyen, Nga Thi Thuy
Roest Crollius, Hugues
author_facet Moyon, Lambert
Berthelot, Camille
Louis, Alexandra
Nguyen, Nga Thi Thuy
Roest Crollius, Hugues
author_sort Moyon, Lambert
collection PubMed
description Whole genome sequencing is increasingly used to diagnose medical conditions of genetic origin. While both coding and non-coding DNA variants contribute to a wide range of diseases, most patients who receive a WGS-based diagnosis today harbour a protein-coding mutation. Functional interpretation and prioritization of non-coding variants represents a persistent challenge, and disease-causing non-coding variants remain largely unidentified. Depending on the disease, WGS fails to identify a candidate variant in 20–80% of patients, severely limiting the usefulness of sequencing for personalised medicine. Here we present FINSURF, a machine-learning approach to predict the functional impact of non-coding variants in regulatory regions. FINSURF outperforms state-of-the-art methods, owing in particular to optimized control variants selection during training. In addition to ranking candidate variants, FINSURF breaks down the score for each variant into contributions from individual annotations, facilitating the evaluation of their functional relevance. We applied FINSURF to a diverse set of 30 diseases with described causative non-coding mutations, and correctly identified the disease-causative non-coding variant within the ten top hits in 22 cases. FINSURF is implemented as an online server to as well as custom browser tracks, and provides a quick and efficient solution to prioritize candidate non-coding variants in realistic clinical settings.
format Online
Article
Text
id pubmed-9094564
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-90945642022-05-12 Classification of non-coding variants with high pathogenic impact Moyon, Lambert Berthelot, Camille Louis, Alexandra Nguyen, Nga Thi Thuy Roest Crollius, Hugues PLoS Genet Research Article Whole genome sequencing is increasingly used to diagnose medical conditions of genetic origin. While both coding and non-coding DNA variants contribute to a wide range of diseases, most patients who receive a WGS-based diagnosis today harbour a protein-coding mutation. Functional interpretation and prioritization of non-coding variants represents a persistent challenge, and disease-causing non-coding variants remain largely unidentified. Depending on the disease, WGS fails to identify a candidate variant in 20–80% of patients, severely limiting the usefulness of sequencing for personalised medicine. Here we present FINSURF, a machine-learning approach to predict the functional impact of non-coding variants in regulatory regions. FINSURF outperforms state-of-the-art methods, owing in particular to optimized control variants selection during training. In addition to ranking candidate variants, FINSURF breaks down the score for each variant into contributions from individual annotations, facilitating the evaluation of their functional relevance. We applied FINSURF to a diverse set of 30 diseases with described causative non-coding mutations, and correctly identified the disease-causative non-coding variant within the ten top hits in 22 cases. FINSURF is implemented as an online server to as well as custom browser tracks, and provides a quick and efficient solution to prioritize candidate non-coding variants in realistic clinical settings. Public Library of Science 2022-04-29 /pmc/articles/PMC9094564/ /pubmed/35486646 http://dx.doi.org/10.1371/journal.pgen.1010191 Text en © 2022 Moyon et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Moyon, Lambert
Berthelot, Camille
Louis, Alexandra
Nguyen, Nga Thi Thuy
Roest Crollius, Hugues
Classification of non-coding variants with high pathogenic impact
title Classification of non-coding variants with high pathogenic impact
title_full Classification of non-coding variants with high pathogenic impact
title_fullStr Classification of non-coding variants with high pathogenic impact
title_full_unstemmed Classification of non-coding variants with high pathogenic impact
title_short Classification of non-coding variants with high pathogenic impact
title_sort classification of non-coding variants with high pathogenic impact
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9094564/
https://www.ncbi.nlm.nih.gov/pubmed/35486646
http://dx.doi.org/10.1371/journal.pgen.1010191
work_keys_str_mv AT moyonlambert classificationofnoncodingvariantswithhighpathogenicimpact
AT berthelotcamille classificationofnoncodingvariantswithhighpathogenicimpact
AT louisalexandra classificationofnoncodingvariantswithhighpathogenicimpact
AT nguyenngathithuy classificationofnoncodingvariantswithhighpathogenicimpact
AT roestcrolliushugues classificationofnoncodingvariantswithhighpathogenicimpact