Cargando…

Interpretation, Stratification and Evidence for Sequence Variants Affecting mRNA Splicing in Complete Human Genome Sequences

Information theory-based methods have been shown to be sensitive and specific for predicting and quantifying the effects of non-coding mutations in Mendelian diseases. We present the Shannon pipeline software for genome-scale mutation analysis and provide evidence that the software predicts variants...

Descripción completa

Detalles Bibliográficos
Autores principales: Shirley, Ben C., Mucaki, Eliseos J., Whitehead, Tyson, Costea, Paul I., Akan, Pelin, Rogan, Peter K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4357664/
https://www.ncbi.nlm.nih.gov/pubmed/23499923
http://dx.doi.org/10.1016/j.gpb.2013.01.008
_version_ 1782361175182278656
author Shirley, Ben C.
Mucaki, Eliseos J.
Whitehead, Tyson
Costea, Paul I.
Akan, Pelin
Rogan, Peter K.
author_facet Shirley, Ben C.
Mucaki, Eliseos J.
Whitehead, Tyson
Costea, Paul I.
Akan, Pelin
Rogan, Peter K.
author_sort Shirley, Ben C.
collection PubMed
description Information theory-based methods have been shown to be sensitive and specific for predicting and quantifying the effects of non-coding mutations in Mendelian diseases. We present the Shannon pipeline software for genome-scale mutation analysis and provide evidence that the software predicts variants affecting mRNA splicing. Individual information contents (in bits) of reference and variant splice sites are compared and significant differences are annotated and prioritized. The software has been implemented for CLC-Bio Genomics platform. Annotation indicates the context of novel mutations as well as common and rare SNPs with splicing effects. Potential natural and cryptic mRNA splicing variants are identified, and null mutations are distinguished from leaky mutations. Mutations and rare SNPs were predicted in genomes of three cancer cell lines (U2OS, U251 and A431), which were supported by expression analyses. After filtering, tractable numbers of potentially deleterious variants are predicted by the software, suitable for further laboratory investigation. In these cell lines, novel functional variants comprised 6–17 inactivating mutations, 1–5 leaky mutations and 6–13 cryptic splicing mutations. Predicted effects were validated by RNA-seq analysis of the three aforementioned cancer cell lines, and expression microarray analysis of SNPs in HapMap cell lines.
format Online
Article
Text
id pubmed-4357664
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-43576642015-05-06 Interpretation, Stratification and Evidence for Sequence Variants Affecting mRNA Splicing in Complete Human Genome Sequences Shirley, Ben C. Mucaki, Eliseos J. Whitehead, Tyson Costea, Paul I. Akan, Pelin Rogan, Peter K. Genomics Proteomics Bioinformatics Original Research Information theory-based methods have been shown to be sensitive and specific for predicting and quantifying the effects of non-coding mutations in Mendelian diseases. We present the Shannon pipeline software for genome-scale mutation analysis and provide evidence that the software predicts variants affecting mRNA splicing. Individual information contents (in bits) of reference and variant splice sites are compared and significant differences are annotated and prioritized. The software has been implemented for CLC-Bio Genomics platform. Annotation indicates the context of novel mutations as well as common and rare SNPs with splicing effects. Potential natural and cryptic mRNA splicing variants are identified, and null mutations are distinguished from leaky mutations. Mutations and rare SNPs were predicted in genomes of three cancer cell lines (U2OS, U251 and A431), which were supported by expression analyses. After filtering, tractable numbers of potentially deleterious variants are predicted by the software, suitable for further laboratory investigation. In these cell lines, novel functional variants comprised 6–17 inactivating mutations, 1–5 leaky mutations and 6–13 cryptic splicing mutations. Predicted effects were validated by RNA-seq analysis of the three aforementioned cancer cell lines, and expression microarray analysis of SNPs in HapMap cell lines. Elsevier 2013-04 2013-03-14 /pmc/articles/PMC4357664/ /pubmed/23499923 http://dx.doi.org/10.1016/j.gpb.2013.01.008 Text en © 2013 Beijing Institute of Genomics, Chinese Academy of Sciences and Genetics Society of China. Production and hosting by Elsevier B.V. All rights reserved. http://creativecommons.org/licenses/by-nc-sa/3.0/ This is an open access article under the CC BY-NC-SA license (http://creativecommons.org/licenses/by-nc-sa/3.0/).
spellingShingle Original Research
Shirley, Ben C.
Mucaki, Eliseos J.
Whitehead, Tyson
Costea, Paul I.
Akan, Pelin
Rogan, Peter K.
Interpretation, Stratification and Evidence for Sequence Variants Affecting mRNA Splicing in Complete Human Genome Sequences
title Interpretation, Stratification and Evidence for Sequence Variants Affecting mRNA Splicing in Complete Human Genome Sequences
title_full Interpretation, Stratification and Evidence for Sequence Variants Affecting mRNA Splicing in Complete Human Genome Sequences
title_fullStr Interpretation, Stratification and Evidence for Sequence Variants Affecting mRNA Splicing in Complete Human Genome Sequences
title_full_unstemmed Interpretation, Stratification and Evidence for Sequence Variants Affecting mRNA Splicing in Complete Human Genome Sequences
title_short Interpretation, Stratification and Evidence for Sequence Variants Affecting mRNA Splicing in Complete Human Genome Sequences
title_sort interpretation, stratification and evidence for sequence variants affecting mrna splicing in complete human genome sequences
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4357664/
https://www.ncbi.nlm.nih.gov/pubmed/23499923
http://dx.doi.org/10.1016/j.gpb.2013.01.008
work_keys_str_mv AT shirleybenc interpretationstratificationandevidenceforsequencevariantsaffectingmrnasplicingincompletehumangenomesequences
AT mucakieliseosj interpretationstratificationandevidenceforsequencevariantsaffectingmrnasplicingincompletehumangenomesequences
AT whiteheadtyson interpretationstratificationandevidenceforsequencevariantsaffectingmrnasplicingincompletehumangenomesequences
AT costeapauli interpretationstratificationandevidenceforsequencevariantsaffectingmrnasplicingincompletehumangenomesequences
AT akanpelin interpretationstratificationandevidenceforsequencevariantsaffectingmrnasplicingincompletehumangenomesequences
AT roganpeterk interpretationstratificationandevidenceforsequencevariantsaffectingmrnasplicingincompletehumangenomesequences