Cargando…

Genomics and data science: an application within an umbrella

Data science allows the extraction of practical insights from large-scale data. Here, we contextualize it as an umbrella term, encompassing several disparate subdomains. We focus on how genomics fits as a specific application subdomain, in terms of well-known 3 V data and 4 M process frameworks (vol...

Descripción completa

Detalles Bibliográficos
Autores principales: Navarro, Fábio C. P., Mohsen, Hussein, Yan, Chengfei, Li, Shantao, Gu, Mengting, Meyerson, William, Gerstein, Mark
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6540394/
https://www.ncbi.nlm.nih.gov/pubmed/31142351
http://dx.doi.org/10.1186/s13059-019-1724-1
_version_ 1783422606823653376
author Navarro, Fábio C. P.
Mohsen, Hussein
Yan, Chengfei
Li, Shantao
Gu, Mengting
Meyerson, William
Gerstein, Mark
author_facet Navarro, Fábio C. P.
Mohsen, Hussein
Yan, Chengfei
Li, Shantao
Gu, Mengting
Meyerson, William
Gerstein, Mark
author_sort Navarro, Fábio C. P.
collection PubMed
description Data science allows the extraction of practical insights from large-scale data. Here, we contextualize it as an umbrella term, encompassing several disparate subdomains. We focus on how genomics fits as a specific application subdomain, in terms of well-known 3 V data and 4 M process frameworks (volume-velocity-variety and measurement-mining-modeling-manipulation, respectively). We further analyze the technical and cultural “exports” and “imports” between genomics and other data-science subdomains (e.g., astronomy). Finally, we discuss how data value, privacy, and ownership are pressing issues for data science applications, in general, and are especially relevant to genomics, due to the persistent nature of DNA.
format Online
Article
Text
id pubmed-6540394
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-65403942019-06-03 Genomics and data science: an application within an umbrella Navarro, Fábio C. P. Mohsen, Hussein Yan, Chengfei Li, Shantao Gu, Mengting Meyerson, William Gerstein, Mark Genome Biol Opinion Data science allows the extraction of practical insights from large-scale data. Here, we contextualize it as an umbrella term, encompassing several disparate subdomains. We focus on how genomics fits as a specific application subdomain, in terms of well-known 3 V data and 4 M process frameworks (volume-velocity-variety and measurement-mining-modeling-manipulation, respectively). We further analyze the technical and cultural “exports” and “imports” between genomics and other data-science subdomains (e.g., astronomy). Finally, we discuss how data value, privacy, and ownership are pressing issues for data science applications, in general, and are especially relevant to genomics, due to the persistent nature of DNA. BioMed Central 2019-05-29 /pmc/articles/PMC6540394/ /pubmed/31142351 http://dx.doi.org/10.1186/s13059-019-1724-1 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Opinion
Navarro, Fábio C. P.
Mohsen, Hussein
Yan, Chengfei
Li, Shantao
Gu, Mengting
Meyerson, William
Gerstein, Mark
Genomics and data science: an application within an umbrella
title Genomics and data science: an application within an umbrella
title_full Genomics and data science: an application within an umbrella
title_fullStr Genomics and data science: an application within an umbrella
title_full_unstemmed Genomics and data science: an application within an umbrella
title_short Genomics and data science: an application within an umbrella
title_sort genomics and data science: an application within an umbrella
topic Opinion
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6540394/
https://www.ncbi.nlm.nih.gov/pubmed/31142351
http://dx.doi.org/10.1186/s13059-019-1724-1
work_keys_str_mv AT navarrofabiocp genomicsanddatascienceanapplicationwithinanumbrella
AT mohsenhussein genomicsanddatascienceanapplicationwithinanumbrella
AT yanchengfei genomicsanddatascienceanapplicationwithinanumbrella
AT lishantao genomicsanddatascienceanapplicationwithinanumbrella
AT gumengting genomicsanddatascienceanapplicationwithinanumbrella
AT meyersonwilliam genomicsanddatascienceanapplicationwithinanumbrella
AT gersteinmark genomicsanddatascienceanapplicationwithinanumbrella