Cargando…

TiSAn: estimating tissue-specific effects of coding and non-coding variants

MOTIVATION: Model-based estimates of general deleteriousness, like CADD, DANN or PolyPhen, have become indispensable tools in the interpretation of genetic variants. However, these approaches say little about the tissues in which the effects of deleterious variants will be most meaningful. Tissue-sp...

Descripción completa

Detalles Bibliográficos
Autores principales: Vervier, Kévin, Michaelson, Jacob J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6137979/
https://www.ncbi.nlm.nih.gov/pubmed/29912365
http://dx.doi.org/10.1093/bioinformatics/bty301
_version_ 1783355269849284608
author Vervier, Kévin
Michaelson, Jacob J
author_facet Vervier, Kévin
Michaelson, Jacob J
author_sort Vervier, Kévin
collection PubMed
description MOTIVATION: Model-based estimates of general deleteriousness, like CADD, DANN or PolyPhen, have become indispensable tools in the interpretation of genetic variants. However, these approaches say little about the tissues in which the effects of deleterious variants will be most meaningful. Tissue-specific annotations have been recently inferred for dozens of tissues/cell types from large collections of cross-tissue epigenomic data, and have demonstrated sensitivity in predicting affected tissues in complex traits. It remains unclear, however, whether including additional genome-scale data specific to the tissue of interest would appreciably improve functional annotations. RESULTS: Herein, we introduce TiSAn, a tool that integrates multiple genome-scale data sources, defined by expert knowledge. TiSAn uses machine learning to discriminate variants relevant to a tissue from those with no bearing on the function of that tissue. Predictions are made genome-wide, and can be used to contextualize and filter variants of interest in whole genome sequencing or genome-wide association studies. We demonstrate the accuracy and flexibility of TiSAn by producing predictive models for human heart and brain, and detecting tissue-relevant variations in large cohorts for autism spectrum disorder (TiSAn-brain) and coronary artery disease (TiSAn-heart). We find the multiomics TiSAn model is better able to prioritize genetic variants according to their tissue-specific action than the current state-of-the-art method, GenoSkyLine. AVAILABILITY AND IMPLEMENTATION: Software and vignettes are available at http://github.com/kevinVervier/TiSAn. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6137979
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-61379792018-09-24 TiSAn: estimating tissue-specific effects of coding and non-coding variants Vervier, Kévin Michaelson, Jacob J Bioinformatics Discovery Note MOTIVATION: Model-based estimates of general deleteriousness, like CADD, DANN or PolyPhen, have become indispensable tools in the interpretation of genetic variants. However, these approaches say little about the tissues in which the effects of deleterious variants will be most meaningful. Tissue-specific annotations have been recently inferred for dozens of tissues/cell types from large collections of cross-tissue epigenomic data, and have demonstrated sensitivity in predicting affected tissues in complex traits. It remains unclear, however, whether including additional genome-scale data specific to the tissue of interest would appreciably improve functional annotations. RESULTS: Herein, we introduce TiSAn, a tool that integrates multiple genome-scale data sources, defined by expert knowledge. TiSAn uses machine learning to discriminate variants relevant to a tissue from those with no bearing on the function of that tissue. Predictions are made genome-wide, and can be used to contextualize and filter variants of interest in whole genome sequencing or genome-wide association studies. We demonstrate the accuracy and flexibility of TiSAn by producing predictive models for human heart and brain, and detecting tissue-relevant variations in large cohorts for autism spectrum disorder (TiSAn-brain) and coronary artery disease (TiSAn-heart). We find the multiomics TiSAn model is better able to prioritize genetic variants according to their tissue-specific action than the current state-of-the-art method, GenoSkyLine. AVAILABILITY AND IMPLEMENTATION: Software and vignettes are available at http://github.com/kevinVervier/TiSAn. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-09-15 2018-04-18 /pmc/articles/PMC6137979/ /pubmed/29912365 http://dx.doi.org/10.1093/bioinformatics/bty301 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Discovery Note
Vervier, Kévin
Michaelson, Jacob J
TiSAn: estimating tissue-specific effects of coding and non-coding variants
title TiSAn: estimating tissue-specific effects of coding and non-coding variants
title_full TiSAn: estimating tissue-specific effects of coding and non-coding variants
title_fullStr TiSAn: estimating tissue-specific effects of coding and non-coding variants
title_full_unstemmed TiSAn: estimating tissue-specific effects of coding and non-coding variants
title_short TiSAn: estimating tissue-specific effects of coding and non-coding variants
title_sort tisan: estimating tissue-specific effects of coding and non-coding variants
topic Discovery Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6137979/
https://www.ncbi.nlm.nih.gov/pubmed/29912365
http://dx.doi.org/10.1093/bioinformatics/bty301
work_keys_str_mv AT vervierkevin tisanestimatingtissuespecificeffectsofcodingandnoncodingvariants
AT michaelsonjacobj tisanestimatingtissuespecificeffectsofcodingandnoncodingvariants