Cargando…
Scalable approaches for functional analyses of whole-genome sequencing non-coding variants
Non-coding genetic variants outside of protein-coding genome regions play an important role in genetic and epigenetic regulation. It has become increasingly important to understand their roles, as non-coding variants often make up the majority of top findings of genome-wide association studies (GWAS...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9585666/ https://www.ncbi.nlm.nih.gov/pubmed/35943817 http://dx.doi.org/10.1093/hmg/ddac191 |
_version_ | 1784813542588809216 |
---|---|
author | Kuksa, Pavel P Greenfest-Allen, Emily Cifello, Jeffrey Ionita, Matei Wang, Hui Nicaretta, Heather Cheng, Po-Liang Lee, Wan-Ping Wang, Li-San Leung, Yuk Yee |
author_facet | Kuksa, Pavel P Greenfest-Allen, Emily Cifello, Jeffrey Ionita, Matei Wang, Hui Nicaretta, Heather Cheng, Po-Liang Lee, Wan-Ping Wang, Li-San Leung, Yuk Yee |
author_sort | Kuksa, Pavel P |
collection | PubMed |
description | Non-coding genetic variants outside of protein-coding genome regions play an important role in genetic and epigenetic regulation. It has become increasingly important to understand their roles, as non-coding variants often make up the majority of top findings of genome-wide association studies (GWAS). In addition, the growing popularity of disease-specific whole-genome sequencing (WGS) efforts expands the library of and offers unique opportunities for investigating both common and rare non-coding variants, which are typically not detected in more limited GWAS approaches. However, the sheer size and breadth of WGS data introduce additional challenges to predicting functional impacts in terms of data analysis and interpretation. This review focuses on the recent approaches developed for efficient, at-scale annotation and prioritization of non-coding variants uncovered in WGS analyses. In particular, we review the latest scalable annotation tools, databases and functional genomic resources for interpreting the variant findings from WGS based on both experimental data and in silico predictive annotations. We also review machine learning-based predictive models for variant scoring and prioritization. We conclude with a discussion of future research directions which will enhance the data and tools necessary for the effective functional analyses of variants identified by WGS to improve our understanding of disease etiology. |
format | Online Article Text |
id | pubmed-9585666 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-95856662022-10-24 Scalable approaches for functional analyses of whole-genome sequencing non-coding variants Kuksa, Pavel P Greenfest-Allen, Emily Cifello, Jeffrey Ionita, Matei Wang, Hui Nicaretta, Heather Cheng, Po-Liang Lee, Wan-Ping Wang, Li-San Leung, Yuk Yee Hum Mol Genet Review Article Non-coding genetic variants outside of protein-coding genome regions play an important role in genetic and epigenetic regulation. It has become increasingly important to understand their roles, as non-coding variants often make up the majority of top findings of genome-wide association studies (GWAS). In addition, the growing popularity of disease-specific whole-genome sequencing (WGS) efforts expands the library of and offers unique opportunities for investigating both common and rare non-coding variants, which are typically not detected in more limited GWAS approaches. However, the sheer size and breadth of WGS data introduce additional challenges to predicting functional impacts in terms of data analysis and interpretation. This review focuses on the recent approaches developed for efficient, at-scale annotation and prioritization of non-coding variants uncovered in WGS analyses. In particular, we review the latest scalable annotation tools, databases and functional genomic resources for interpreting the variant findings from WGS based on both experimental data and in silico predictive annotations. We also review machine learning-based predictive models for variant scoring and prioritization. We conclude with a discussion of future research directions which will enhance the data and tools necessary for the effective functional analyses of variants identified by WGS to improve our understanding of disease etiology. Oxford University Press 2022-08-09 /pmc/articles/PMC9585666/ /pubmed/35943817 http://dx.doi.org/10.1093/hmg/ddac191 Text en Published by Oxford University Press 2022. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Review Article Kuksa, Pavel P Greenfest-Allen, Emily Cifello, Jeffrey Ionita, Matei Wang, Hui Nicaretta, Heather Cheng, Po-Liang Lee, Wan-Ping Wang, Li-San Leung, Yuk Yee Scalable approaches for functional analyses of whole-genome sequencing non-coding variants |
title | Scalable approaches for functional analyses of whole-genome sequencing non-coding variants |
title_full | Scalable approaches for functional analyses of whole-genome sequencing non-coding variants |
title_fullStr | Scalable approaches for functional analyses of whole-genome sequencing non-coding variants |
title_full_unstemmed | Scalable approaches for functional analyses of whole-genome sequencing non-coding variants |
title_short | Scalable approaches for functional analyses of whole-genome sequencing non-coding variants |
title_sort | scalable approaches for functional analyses of whole-genome sequencing non-coding variants |
topic | Review Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9585666/ https://www.ncbi.nlm.nih.gov/pubmed/35943817 http://dx.doi.org/10.1093/hmg/ddac191 |
work_keys_str_mv | AT kuksapavelp scalableapproachesforfunctionalanalysesofwholegenomesequencingnoncodingvariants AT greenfestallenemily scalableapproachesforfunctionalanalysesofwholegenomesequencingnoncodingvariants AT cifellojeffrey scalableapproachesforfunctionalanalysesofwholegenomesequencingnoncodingvariants AT ionitamatei scalableapproachesforfunctionalanalysesofwholegenomesequencingnoncodingvariants AT wanghui scalableapproachesforfunctionalanalysesofwholegenomesequencingnoncodingvariants AT nicarettaheather scalableapproachesforfunctionalanalysesofwholegenomesequencingnoncodingvariants AT chengpoliang scalableapproachesforfunctionalanalysesofwholegenomesequencingnoncodingvariants AT leewanping scalableapproachesforfunctionalanalysesofwholegenomesequencingnoncodingvariants AT wanglisan scalableapproachesforfunctionalanalysesofwholegenomesequencingnoncodingvariants AT leungyukyee scalableapproachesforfunctionalanalysesofwholegenomesequencingnoncodingvariants |