Cargando…

How data science and AI-based technologies impact genomics

Advancements in high-throughput sequencing have yielded vast amounts of genomic data, which are studied using genome-wide association study (GWAS)/phenome-wide association study (PheWAS) methods to identify associations between the genotype and phenotype. The associated findings have contributed to...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Jing, Ngiam, Kee Yuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Wolters Kluwer - Medknow 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9979798/
https://www.ncbi.nlm.nih.gov/pubmed/36722518
http://dx.doi.org/10.4103/singaporemedj.SMJ-2021-438
_version_ 1784899789906771968
author Lin, Jing
Ngiam, Kee Yuan
author_facet Lin, Jing
Ngiam, Kee Yuan
author_sort Lin, Jing
collection PubMed
description Advancements in high-throughput sequencing have yielded vast amounts of genomic data, which are studied using genome-wide association study (GWAS)/phenome-wide association study (PheWAS) methods to identify associations between the genotype and phenotype. The associated findings have contributed to pharmacogenomics and improved clinical decision support at the point of care in many healthcare systems. However, the accumulation of genomic data from sequencing and clinical data from electronic health records (EHRs) poses significant challenges for data scientists. Following the rise of artificial intelligence (AI) technology such as machine learning and deep learning, an increasing number of GWAS/PheWAS studies have successfully leveraged this technology to overcome the aforementioned challenges. In this review, we focus on the application of data science and AI technology in three areas, including risk prediction and identification of causal single-nucleotide polymorphisms, EHR-based phenotyping and CRISPR guide RNA design. Additionally, we highlight a few emerging AI technologies, such as transfer learning and multi-view learning, which will or have started to benefit genomic studies.
format Online
Article
Text
id pubmed-9979798
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Wolters Kluwer - Medknow
record_format MEDLINE/PubMed
spelling pubmed-99797982023-03-03 How data science and AI-based technologies impact genomics Lin, Jing Ngiam, Kee Yuan Singapore Med J Review Article Advancements in high-throughput sequencing have yielded vast amounts of genomic data, which are studied using genome-wide association study (GWAS)/phenome-wide association study (PheWAS) methods to identify associations between the genotype and phenotype. The associated findings have contributed to pharmacogenomics and improved clinical decision support at the point of care in many healthcare systems. However, the accumulation of genomic data from sequencing and clinical data from electronic health records (EHRs) poses significant challenges for data scientists. Following the rise of artificial intelligence (AI) technology such as machine learning and deep learning, an increasing number of GWAS/PheWAS studies have successfully leveraged this technology to overcome the aforementioned challenges. In this review, we focus on the application of data science and AI technology in three areas, including risk prediction and identification of causal single-nucleotide polymorphisms, EHR-based phenotyping and CRISPR guide RNA design. Additionally, we highlight a few emerging AI technologies, such as transfer learning and multi-view learning, which will or have started to benefit genomic studies. Wolters Kluwer - Medknow 2023-01-19 /pmc/articles/PMC9979798/ /pubmed/36722518 http://dx.doi.org/10.4103/singaporemedj.SMJ-2021-438 Text en Copyright: © 2023 Singapore Medical Journal https://creativecommons.org/licenses/by-nc-sa/4.0/This is an open access journal, and articles are distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License, which allows others to remix, tweak, and build upon the work non-commercially, as long as appropriate credit is given and the new creations are licensed under the identical terms.
spellingShingle Review Article
Lin, Jing
Ngiam, Kee Yuan
How data science and AI-based technologies impact genomics
title How data science and AI-based technologies impact genomics
title_full How data science and AI-based technologies impact genomics
title_fullStr How data science and AI-based technologies impact genomics
title_full_unstemmed How data science and AI-based technologies impact genomics
title_short How data science and AI-based technologies impact genomics
title_sort how data science and ai-based technologies impact genomics
topic Review Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9979798/
https://www.ncbi.nlm.nih.gov/pubmed/36722518
http://dx.doi.org/10.4103/singaporemedj.SMJ-2021-438
work_keys_str_mv AT linjing howdatascienceandaibasedtechnologiesimpactgenomics
AT ngiamkeeyuan howdatascienceandaibasedtechnologiesimpactgenomics