Cargando…

In silico prediction of splice-altering single nucleotide variants in the human genome

In silico tools have been developed to predict variants that may have an impact on pre-mRNA splicing. The major limitation of the application of these tools to basic research and clinical practice is the difficulty in interpreting the output. Most tools only predict potential splice sites given a DN...

Descripción completa

Detalles Bibliográficos
Autores principales: Jian, Xueqiu, Boerwinkle, Eric, Liu, Xiaoming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4267638/
https://www.ncbi.nlm.nih.gov/pubmed/25416802
http://dx.doi.org/10.1093/nar/gku1206
_version_ 1782349174044360704
author Jian, Xueqiu
Boerwinkle, Eric
Liu, Xiaoming
author_facet Jian, Xueqiu
Boerwinkle, Eric
Liu, Xiaoming
author_sort Jian, Xueqiu
collection PubMed
description In silico tools have been developed to predict variants that may have an impact on pre-mRNA splicing. The major limitation of the application of these tools to basic research and clinical practice is the difficulty in interpreting the output. Most tools only predict potential splice sites given a DNA sequence without measuring splicing signal changes caused by a variant. Another limitation is the lack of large-scale evaluation studies of these tools. We compared eight in silico tools on 2959 single nucleotide variants within splicing consensus regions (scSNVs) using receiver operating characteristic analysis. The Position Weight Matrix model and MaxEntScan outperformed other methods. Two ensemble learning methods, adaptive boosting and random forests, were used to construct models that take advantage of individual methods. Both models further improved prediction, with outputs of directly interpretable prediction scores. We applied our ensemble scores to scSNVs from the Catalogue of Somatic Mutations in Cancer database. Analysis showed that predicted splice-altering scSNVs are enriched in recurrent scSNVs and known cancer genes. We pre-computed our ensemble scores for all potential scSNVs across the human genome, providing a whole genome level resource for identifying splice-altering scSNVs discovered from large-scale sequencing studies.
format Online
Article
Text
id pubmed-4267638
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-42676382014-12-23 In silico prediction of splice-altering single nucleotide variants in the human genome Jian, Xueqiu Boerwinkle, Eric Liu, Xiaoming Nucleic Acids Res Data Resources and Analyses In silico tools have been developed to predict variants that may have an impact on pre-mRNA splicing. The major limitation of the application of these tools to basic research and clinical practice is the difficulty in interpreting the output. Most tools only predict potential splice sites given a DNA sequence without measuring splicing signal changes caused by a variant. Another limitation is the lack of large-scale evaluation studies of these tools. We compared eight in silico tools on 2959 single nucleotide variants within splicing consensus regions (scSNVs) using receiver operating characteristic analysis. The Position Weight Matrix model and MaxEntScan outperformed other methods. Two ensemble learning methods, adaptive boosting and random forests, were used to construct models that take advantage of individual methods. Both models further improved prediction, with outputs of directly interpretable prediction scores. We applied our ensemble scores to scSNVs from the Catalogue of Somatic Mutations in Cancer database. Analysis showed that predicted splice-altering scSNVs are enriched in recurrent scSNVs and known cancer genes. We pre-computed our ensemble scores for all potential scSNVs across the human genome, providing a whole genome level resource for identifying splice-altering scSNVs discovered from large-scale sequencing studies. Oxford University Press 2014-12-16 2014-11-21 /pmc/articles/PMC4267638/ /pubmed/25416802 http://dx.doi.org/10.1093/nar/gku1206 Text en © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Data Resources and Analyses
Jian, Xueqiu
Boerwinkle, Eric
Liu, Xiaoming
In silico prediction of splice-altering single nucleotide variants in the human genome
title In silico prediction of splice-altering single nucleotide variants in the human genome
title_full In silico prediction of splice-altering single nucleotide variants in the human genome
title_fullStr In silico prediction of splice-altering single nucleotide variants in the human genome
title_full_unstemmed In silico prediction of splice-altering single nucleotide variants in the human genome
title_short In silico prediction of splice-altering single nucleotide variants in the human genome
title_sort in silico prediction of splice-altering single nucleotide variants in the human genome
topic Data Resources and Analyses
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4267638/
https://www.ncbi.nlm.nih.gov/pubmed/25416802
http://dx.doi.org/10.1093/nar/gku1206
work_keys_str_mv AT jianxueqiu insilicopredictionofsplicealteringsinglenucleotidevariantsinthehumangenome
AT boerwinkleeric insilicopredictionofsplicealteringsinglenucleotidevariantsinthehumangenome
AT liuxiaoming insilicopredictionofsplicealteringsinglenucleotidevariantsinthehumangenome