Cargando…
TiSAn: estimating tissue-specific effects of coding and non-coding variants
MOTIVATION: Model-based estimates of general deleteriousness, like CADD, DANN or PolyPhen, have become indispensable tools in the interpretation of genetic variants. However, these approaches say little about the tissues in which the effects of deleterious variants will be most meaningful. Tissue-sp...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6137979/ https://www.ncbi.nlm.nih.gov/pubmed/29912365 http://dx.doi.org/10.1093/bioinformatics/bty301 |
_version_ | 1783355269849284608 |
---|---|
author | Vervier, Kévin Michaelson, Jacob J |
author_facet | Vervier, Kévin Michaelson, Jacob J |
author_sort | Vervier, Kévin |
collection | PubMed |
description | MOTIVATION: Model-based estimates of general deleteriousness, like CADD, DANN or PolyPhen, have become indispensable tools in the interpretation of genetic variants. However, these approaches say little about the tissues in which the effects of deleterious variants will be most meaningful. Tissue-specific annotations have been recently inferred for dozens of tissues/cell types from large collections of cross-tissue epigenomic data, and have demonstrated sensitivity in predicting affected tissues in complex traits. It remains unclear, however, whether including additional genome-scale data specific to the tissue of interest would appreciably improve functional annotations. RESULTS: Herein, we introduce TiSAn, a tool that integrates multiple genome-scale data sources, defined by expert knowledge. TiSAn uses machine learning to discriminate variants relevant to a tissue from those with no bearing on the function of that tissue. Predictions are made genome-wide, and can be used to contextualize and filter variants of interest in whole genome sequencing or genome-wide association studies. We demonstrate the accuracy and flexibility of TiSAn by producing predictive models for human heart and brain, and detecting tissue-relevant variations in large cohorts for autism spectrum disorder (TiSAn-brain) and coronary artery disease (TiSAn-heart). We find the multiomics TiSAn model is better able to prioritize genetic variants according to their tissue-specific action than the current state-of-the-art method, GenoSkyLine. AVAILABILITY AND IMPLEMENTATION: Software and vignettes are available at http://github.com/kevinVervier/TiSAn. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-6137979 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-61379792018-09-24 TiSAn: estimating tissue-specific effects of coding and non-coding variants Vervier, Kévin Michaelson, Jacob J Bioinformatics Discovery Note MOTIVATION: Model-based estimates of general deleteriousness, like CADD, DANN or PolyPhen, have become indispensable tools in the interpretation of genetic variants. However, these approaches say little about the tissues in which the effects of deleterious variants will be most meaningful. Tissue-specific annotations have been recently inferred for dozens of tissues/cell types from large collections of cross-tissue epigenomic data, and have demonstrated sensitivity in predicting affected tissues in complex traits. It remains unclear, however, whether including additional genome-scale data specific to the tissue of interest would appreciably improve functional annotations. RESULTS: Herein, we introduce TiSAn, a tool that integrates multiple genome-scale data sources, defined by expert knowledge. TiSAn uses machine learning to discriminate variants relevant to a tissue from those with no bearing on the function of that tissue. Predictions are made genome-wide, and can be used to contextualize and filter variants of interest in whole genome sequencing or genome-wide association studies. We demonstrate the accuracy and flexibility of TiSAn by producing predictive models for human heart and brain, and detecting tissue-relevant variations in large cohorts for autism spectrum disorder (TiSAn-brain) and coronary artery disease (TiSAn-heart). We find the multiomics TiSAn model is better able to prioritize genetic variants according to their tissue-specific action than the current state-of-the-art method, GenoSkyLine. AVAILABILITY AND IMPLEMENTATION: Software and vignettes are available at http://github.com/kevinVervier/TiSAn. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-09-15 2018-04-18 /pmc/articles/PMC6137979/ /pubmed/29912365 http://dx.doi.org/10.1093/bioinformatics/bty301 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Discovery Note Vervier, Kévin Michaelson, Jacob J TiSAn: estimating tissue-specific effects of coding and non-coding variants |
title | TiSAn: estimating tissue-specific effects of coding and non-coding variants |
title_full | TiSAn: estimating tissue-specific effects of coding and non-coding variants |
title_fullStr | TiSAn: estimating tissue-specific effects of coding and non-coding variants |
title_full_unstemmed | TiSAn: estimating tissue-specific effects of coding and non-coding variants |
title_short | TiSAn: estimating tissue-specific effects of coding and non-coding variants |
title_sort | tisan: estimating tissue-specific effects of coding and non-coding variants |
topic | Discovery Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6137979/ https://www.ncbi.nlm.nih.gov/pubmed/29912365 http://dx.doi.org/10.1093/bioinformatics/bty301 |
work_keys_str_mv | AT vervierkevin tisanestimatingtissuespecificeffectsofcodingandnoncodingvariants AT michaelsonjacobj tisanestimatingtissuespecificeffectsofcodingandnoncodingvariants |