Cargando…

Introme accurately predicts the impact of coding and noncoding variants on gene splicing, with clinical applications

Predicting the impact of coding and noncoding variants on splicing is challenging, particularly in non-canonical splice sites, leading to missed diagnoses in patients. Existing splice prediction tools are complementary but knowing which to use for each splicing context remains difficult. Here, we de...

Descripción completa

Detalles Bibliográficos
Autores principales: Sullivan, Patricia J., Gayevskiy, Velimir, Davis, Ryan L., Wong, Marie, Mayoh, Chelsea, Mallawaarachchi, Amali, Hort, Yvonne, McCabe, Mark J., Beecroft, Sarah, Jackson, Matilda R., Arts, Peer, Dubowsky, Andrew, Laing, Nigel, Dinger, Marcel E., Scott, Hamish S., Oates, Emily, Pinese, Mark, Cowley, Mark J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10190034/
https://www.ncbi.nlm.nih.gov/pubmed/37198692
http://dx.doi.org/10.1186/s13059-023-02936-7
_version_ 1785043204705353728
author Sullivan, Patricia J.
Gayevskiy, Velimir
Davis, Ryan L.
Wong, Marie
Mayoh, Chelsea
Mallawaarachchi, Amali
Hort, Yvonne
McCabe, Mark J.
Beecroft, Sarah
Jackson, Matilda R.
Arts, Peer
Dubowsky, Andrew
Laing, Nigel
Dinger, Marcel E.
Scott, Hamish S.
Oates, Emily
Pinese, Mark
Cowley, Mark J.
author_facet Sullivan, Patricia J.
Gayevskiy, Velimir
Davis, Ryan L.
Wong, Marie
Mayoh, Chelsea
Mallawaarachchi, Amali
Hort, Yvonne
McCabe, Mark J.
Beecroft, Sarah
Jackson, Matilda R.
Arts, Peer
Dubowsky, Andrew
Laing, Nigel
Dinger, Marcel E.
Scott, Hamish S.
Oates, Emily
Pinese, Mark
Cowley, Mark J.
author_sort Sullivan, Patricia J.
collection PubMed
description Predicting the impact of coding and noncoding variants on splicing is challenging, particularly in non-canonical splice sites, leading to missed diagnoses in patients. Existing splice prediction tools are complementary but knowing which to use for each splicing context remains difficult. Here, we describe Introme, which uses machine learning to integrate predictions from several splice detection tools, additional splicing rules, and gene architecture features to comprehensively evaluate the likelihood of a variant impacting splicing. Through extensive benchmarking across 21,000 splice-altering variants, Introme outperformed all tools (auPRC: 0.98) for the detection of clinically significant splice variants. Introme is available at https://github.com/CCICB/introme. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-023-02936-7.
format Online
Article
Text
id pubmed-10190034
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-101900342023-05-18 Introme accurately predicts the impact of coding and noncoding variants on gene splicing, with clinical applications Sullivan, Patricia J. Gayevskiy, Velimir Davis, Ryan L. Wong, Marie Mayoh, Chelsea Mallawaarachchi, Amali Hort, Yvonne McCabe, Mark J. Beecroft, Sarah Jackson, Matilda R. Arts, Peer Dubowsky, Andrew Laing, Nigel Dinger, Marcel E. Scott, Hamish S. Oates, Emily Pinese, Mark Cowley, Mark J. Genome Biol Method Predicting the impact of coding and noncoding variants on splicing is challenging, particularly in non-canonical splice sites, leading to missed diagnoses in patients. Existing splice prediction tools are complementary but knowing which to use for each splicing context remains difficult. Here, we describe Introme, which uses machine learning to integrate predictions from several splice detection tools, additional splicing rules, and gene architecture features to comprehensively evaluate the likelihood of a variant impacting splicing. Through extensive benchmarking across 21,000 splice-altering variants, Introme outperformed all tools (auPRC: 0.98) for the detection of clinically significant splice variants. Introme is available at https://github.com/CCICB/introme. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-023-02936-7. BioMed Central 2023-05-17 /pmc/articles/PMC10190034/ /pubmed/37198692 http://dx.doi.org/10.1186/s13059-023-02936-7 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Method
Sullivan, Patricia J.
Gayevskiy, Velimir
Davis, Ryan L.
Wong, Marie
Mayoh, Chelsea
Mallawaarachchi, Amali
Hort, Yvonne
McCabe, Mark J.
Beecroft, Sarah
Jackson, Matilda R.
Arts, Peer
Dubowsky, Andrew
Laing, Nigel
Dinger, Marcel E.
Scott, Hamish S.
Oates, Emily
Pinese, Mark
Cowley, Mark J.
Introme accurately predicts the impact of coding and noncoding variants on gene splicing, with clinical applications
title Introme accurately predicts the impact of coding and noncoding variants on gene splicing, with clinical applications
title_full Introme accurately predicts the impact of coding and noncoding variants on gene splicing, with clinical applications
title_fullStr Introme accurately predicts the impact of coding and noncoding variants on gene splicing, with clinical applications
title_full_unstemmed Introme accurately predicts the impact of coding and noncoding variants on gene splicing, with clinical applications
title_short Introme accurately predicts the impact of coding and noncoding variants on gene splicing, with clinical applications
title_sort introme accurately predicts the impact of coding and noncoding variants on gene splicing, with clinical applications
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10190034/
https://www.ncbi.nlm.nih.gov/pubmed/37198692
http://dx.doi.org/10.1186/s13059-023-02936-7
work_keys_str_mv AT sullivanpatriciaj intromeaccuratelypredictstheimpactofcodingandnoncodingvariantsongenesplicingwithclinicalapplications
AT gayevskiyvelimir intromeaccuratelypredictstheimpactofcodingandnoncodingvariantsongenesplicingwithclinicalapplications
AT davisryanl intromeaccuratelypredictstheimpactofcodingandnoncodingvariantsongenesplicingwithclinicalapplications
AT wongmarie intromeaccuratelypredictstheimpactofcodingandnoncodingvariantsongenesplicingwithclinicalapplications
AT mayohchelsea intromeaccuratelypredictstheimpactofcodingandnoncodingvariantsongenesplicingwithclinicalapplications
AT mallawaarachchiamali intromeaccuratelypredictstheimpactofcodingandnoncodingvariantsongenesplicingwithclinicalapplications
AT hortyvonne intromeaccuratelypredictstheimpactofcodingandnoncodingvariantsongenesplicingwithclinicalapplications
AT mccabemarkj intromeaccuratelypredictstheimpactofcodingandnoncodingvariantsongenesplicingwithclinicalapplications
AT beecroftsarah intromeaccuratelypredictstheimpactofcodingandnoncodingvariantsongenesplicingwithclinicalapplications
AT jacksonmatildar intromeaccuratelypredictstheimpactofcodingandnoncodingvariantsongenesplicingwithclinicalapplications
AT artspeer intromeaccuratelypredictstheimpactofcodingandnoncodingvariantsongenesplicingwithclinicalapplications
AT dubowskyandrew intromeaccuratelypredictstheimpactofcodingandnoncodingvariantsongenesplicingwithclinicalapplications
AT laingnigel intromeaccuratelypredictstheimpactofcodingandnoncodingvariantsongenesplicingwithclinicalapplications
AT dingermarcele intromeaccuratelypredictstheimpactofcodingandnoncodingvariantsongenesplicingwithclinicalapplications
AT scotthamishs intromeaccuratelypredictstheimpactofcodingandnoncodingvariantsongenesplicingwithclinicalapplications
AT oatesemily intromeaccuratelypredictstheimpactofcodingandnoncodingvariantsongenesplicingwithclinicalapplications
AT pinesemark intromeaccuratelypredictstheimpactofcodingandnoncodingvariantsongenesplicingwithclinicalapplications
AT cowleymarkj intromeaccuratelypredictstheimpactofcodingandnoncodingvariantsongenesplicingwithclinicalapplications