Cargando…

Scalable approaches for functional analyses of whole-genome sequencing non-coding variants

Non-coding genetic variants outside of protein-coding genome regions play an important role in genetic and epigenetic regulation. It has become increasingly important to understand their roles, as non-coding variants often make up the majority of top findings of genome-wide association studies (GWAS...

Descripción completa

Detalles Bibliográficos
Autores principales: Kuksa, Pavel P, Greenfest-Allen, Emily, Cifello, Jeffrey, Ionita, Matei, Wang, Hui, Nicaretta, Heather, Cheng, Po-Liang, Lee, Wan-Ping, Wang, Li-San, Leung, Yuk Yee
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9585666/
https://www.ncbi.nlm.nih.gov/pubmed/35943817
http://dx.doi.org/10.1093/hmg/ddac191
_version_ 1784813542588809216
author Kuksa, Pavel P
Greenfest-Allen, Emily
Cifello, Jeffrey
Ionita, Matei
Wang, Hui
Nicaretta, Heather
Cheng, Po-Liang
Lee, Wan-Ping
Wang, Li-San
Leung, Yuk Yee
author_facet Kuksa, Pavel P
Greenfest-Allen, Emily
Cifello, Jeffrey
Ionita, Matei
Wang, Hui
Nicaretta, Heather
Cheng, Po-Liang
Lee, Wan-Ping
Wang, Li-San
Leung, Yuk Yee
author_sort Kuksa, Pavel P
collection PubMed
description Non-coding genetic variants outside of protein-coding genome regions play an important role in genetic and epigenetic regulation. It has become increasingly important to understand their roles, as non-coding variants often make up the majority of top findings of genome-wide association studies (GWAS). In addition, the growing popularity of disease-specific whole-genome sequencing (WGS) efforts expands the library of and offers unique opportunities for investigating both common and rare non-coding variants, which are typically not detected in more limited GWAS approaches. However, the sheer size and breadth of WGS data introduce additional challenges to predicting functional impacts in terms of data analysis and interpretation. This review focuses on the recent approaches developed for efficient, at-scale annotation and prioritization of non-coding variants uncovered in WGS analyses. In particular, we review the latest scalable annotation tools, databases and functional genomic resources for interpreting the variant findings from WGS based on both experimental data and in silico predictive annotations. We also review machine learning-based predictive models for variant scoring and prioritization. We conclude with a discussion of future research directions which will enhance the data and tools necessary for the effective functional analyses of variants identified by WGS to improve our understanding of disease etiology.
format Online
Article
Text
id pubmed-9585666
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-95856662022-10-24 Scalable approaches for functional analyses of whole-genome sequencing non-coding variants Kuksa, Pavel P Greenfest-Allen, Emily Cifello, Jeffrey Ionita, Matei Wang, Hui Nicaretta, Heather Cheng, Po-Liang Lee, Wan-Ping Wang, Li-San Leung, Yuk Yee Hum Mol Genet Review Article Non-coding genetic variants outside of protein-coding genome regions play an important role in genetic and epigenetic regulation. It has become increasingly important to understand their roles, as non-coding variants often make up the majority of top findings of genome-wide association studies (GWAS). In addition, the growing popularity of disease-specific whole-genome sequencing (WGS) efforts expands the library of and offers unique opportunities for investigating both common and rare non-coding variants, which are typically not detected in more limited GWAS approaches. However, the sheer size and breadth of WGS data introduce additional challenges to predicting functional impacts in terms of data analysis and interpretation. This review focuses on the recent approaches developed for efficient, at-scale annotation and prioritization of non-coding variants uncovered in WGS analyses. In particular, we review the latest scalable annotation tools, databases and functional genomic resources for interpreting the variant findings from WGS based on both experimental data and in silico predictive annotations. We also review machine learning-based predictive models for variant scoring and prioritization. We conclude with a discussion of future research directions which will enhance the data and tools necessary for the effective functional analyses of variants identified by WGS to improve our understanding of disease etiology. Oxford University Press 2022-08-09 /pmc/articles/PMC9585666/ /pubmed/35943817 http://dx.doi.org/10.1093/hmg/ddac191 Text en Published by Oxford University Press 2022. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Review Article
Kuksa, Pavel P
Greenfest-Allen, Emily
Cifello, Jeffrey
Ionita, Matei
Wang, Hui
Nicaretta, Heather
Cheng, Po-Liang
Lee, Wan-Ping
Wang, Li-San
Leung, Yuk Yee
Scalable approaches for functional analyses of whole-genome sequencing non-coding variants
title Scalable approaches for functional analyses of whole-genome sequencing non-coding variants
title_full Scalable approaches for functional analyses of whole-genome sequencing non-coding variants
title_fullStr Scalable approaches for functional analyses of whole-genome sequencing non-coding variants
title_full_unstemmed Scalable approaches for functional analyses of whole-genome sequencing non-coding variants
title_short Scalable approaches for functional analyses of whole-genome sequencing non-coding variants
title_sort scalable approaches for functional analyses of whole-genome sequencing non-coding variants
topic Review Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9585666/
https://www.ncbi.nlm.nih.gov/pubmed/35943817
http://dx.doi.org/10.1093/hmg/ddac191
work_keys_str_mv AT kuksapavelp scalableapproachesforfunctionalanalysesofwholegenomesequencingnoncodingvariants
AT greenfestallenemily scalableapproachesforfunctionalanalysesofwholegenomesequencingnoncodingvariants
AT cifellojeffrey scalableapproachesforfunctionalanalysesofwholegenomesequencingnoncodingvariants
AT ionitamatei scalableapproachesforfunctionalanalysesofwholegenomesequencingnoncodingvariants
AT wanghui scalableapproachesforfunctionalanalysesofwholegenomesequencingnoncodingvariants
AT nicarettaheather scalableapproachesforfunctionalanalysesofwholegenomesequencingnoncodingvariants
AT chengpoliang scalableapproachesforfunctionalanalysesofwholegenomesequencingnoncodingvariants
AT leewanping scalableapproachesforfunctionalanalysesofwholegenomesequencingnoncodingvariants
AT wanglisan scalableapproachesforfunctionalanalysesofwholegenomesequencingnoncodingvariants
AT leungyukyee scalableapproachesforfunctionalanalysesofwholegenomesequencingnoncodingvariants