Cargando…

A Comprehensive Infrastructure for Big Data in Cancer Research: Accelerating Cancer Research and Precision Medicine

Advancements in next-generation sequencing and other -omics technologies are accelerating the detailed molecular characterization of individual patient tumors, and driving the evolution of precision medicine. Cancer is no longer considered a single disease, but rather, a diverse array of diseases wh...

Descripción completa

Detalles Bibliográficos
Autores principales: Hinkson, Izumi V., Davidsen, Tanja M., Klemm, Juli D., Chandramouliswaran, Ishwar, Kerlavage, Anthony R., Kibbe, Warren A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5613113/
https://www.ncbi.nlm.nih.gov/pubmed/28983483
http://dx.doi.org/10.3389/fcell.2017.00083
_version_ 1783266186672209920
author Hinkson, Izumi V.
Davidsen, Tanja M.
Klemm, Juli D.
Chandramouliswaran, Ishwar
Kerlavage, Anthony R.
Kibbe, Warren A.
author_facet Hinkson, Izumi V.
Davidsen, Tanja M.
Klemm, Juli D.
Chandramouliswaran, Ishwar
Kerlavage, Anthony R.
Kibbe, Warren A.
author_sort Hinkson, Izumi V.
collection PubMed
description Advancements in next-generation sequencing and other -omics technologies are accelerating the detailed molecular characterization of individual patient tumors, and driving the evolution of precision medicine. Cancer is no longer considered a single disease, but rather, a diverse array of diseases wherein each patient has a unique collection of germline variants and somatic mutations. Molecular profiling of patient-derived samples has led to a data explosion that could help us understand the contributions of environment and germline to risk, therapeutic response, and outcome. To maximize the value of these data, an interdisciplinary approach is paramount. The National Cancer Institute (NCI) has initiated multiple projects to characterize tumor samples using multi-omic approaches. These projects harness the expertise of clinicians, biologists, computer scientists, and software engineers to investigate cancer biology and therapeutic response in multidisciplinary teams. Petabytes of cancer genomic, transcriptomic, epigenomic, proteomic, and imaging data have been generated by these projects. To address the data analysis challenges associated with these large datasets, the NCI has sponsored the development of the Genomic Data Commons (GDC) and three Cloud Resources. The GDC ensures data and metadata quality, ingests and harmonizes genomic data, and securely redistributes the data. During its pilot phase, the Cloud Resources tested multiple cloud-based approaches for enhancing data access, collaboration, computational scalability, resource democratization, and reproducibility. These NCI-led efforts are continuously being refined to better support open data practices and precision oncology, and to serve as building blocks of the NCI Cancer Research Data Commons.
format Online
Article
Text
id pubmed-5613113
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-56131132017-10-05 A Comprehensive Infrastructure for Big Data in Cancer Research: Accelerating Cancer Research and Precision Medicine Hinkson, Izumi V. Davidsen, Tanja M. Klemm, Juli D. Chandramouliswaran, Ishwar Kerlavage, Anthony R. Kibbe, Warren A. Front Cell Dev Biol Cell and Developmental Biology Advancements in next-generation sequencing and other -omics technologies are accelerating the detailed molecular characterization of individual patient tumors, and driving the evolution of precision medicine. Cancer is no longer considered a single disease, but rather, a diverse array of diseases wherein each patient has a unique collection of germline variants and somatic mutations. Molecular profiling of patient-derived samples has led to a data explosion that could help us understand the contributions of environment and germline to risk, therapeutic response, and outcome. To maximize the value of these data, an interdisciplinary approach is paramount. The National Cancer Institute (NCI) has initiated multiple projects to characterize tumor samples using multi-omic approaches. These projects harness the expertise of clinicians, biologists, computer scientists, and software engineers to investigate cancer biology and therapeutic response in multidisciplinary teams. Petabytes of cancer genomic, transcriptomic, epigenomic, proteomic, and imaging data have been generated by these projects. To address the data analysis challenges associated with these large datasets, the NCI has sponsored the development of the Genomic Data Commons (GDC) and three Cloud Resources. The GDC ensures data and metadata quality, ingests and harmonizes genomic data, and securely redistributes the data. During its pilot phase, the Cloud Resources tested multiple cloud-based approaches for enhancing data access, collaboration, computational scalability, resource democratization, and reproducibility. These NCI-led efforts are continuously being refined to better support open data practices and precision oncology, and to serve as building blocks of the NCI Cancer Research Data Commons. Frontiers Media S.A. 2017-09-21 /pmc/articles/PMC5613113/ /pubmed/28983483 http://dx.doi.org/10.3389/fcell.2017.00083 Text en Copyright © 2017 Hinkson, Davidsen, Klemm, Chandramouliswaran, Kerlavage and Kibbe. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Cell and Developmental Biology
Hinkson, Izumi V.
Davidsen, Tanja M.
Klemm, Juli D.
Chandramouliswaran, Ishwar
Kerlavage, Anthony R.
Kibbe, Warren A.
A Comprehensive Infrastructure for Big Data in Cancer Research: Accelerating Cancer Research and Precision Medicine
title A Comprehensive Infrastructure for Big Data in Cancer Research: Accelerating Cancer Research and Precision Medicine
title_full A Comprehensive Infrastructure for Big Data in Cancer Research: Accelerating Cancer Research and Precision Medicine
title_fullStr A Comprehensive Infrastructure for Big Data in Cancer Research: Accelerating Cancer Research and Precision Medicine
title_full_unstemmed A Comprehensive Infrastructure for Big Data in Cancer Research: Accelerating Cancer Research and Precision Medicine
title_short A Comprehensive Infrastructure for Big Data in Cancer Research: Accelerating Cancer Research and Precision Medicine
title_sort comprehensive infrastructure for big data in cancer research: accelerating cancer research and precision medicine
topic Cell and Developmental Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5613113/
https://www.ncbi.nlm.nih.gov/pubmed/28983483
http://dx.doi.org/10.3389/fcell.2017.00083
work_keys_str_mv AT hinksonizumiv acomprehensiveinfrastructureforbigdataincancerresearchacceleratingcancerresearchandprecisionmedicine
AT davidsentanjam acomprehensiveinfrastructureforbigdataincancerresearchacceleratingcancerresearchandprecisionmedicine
AT klemmjulid acomprehensiveinfrastructureforbigdataincancerresearchacceleratingcancerresearchandprecisionmedicine
AT chandramouliswaranishwar acomprehensiveinfrastructureforbigdataincancerresearchacceleratingcancerresearchandprecisionmedicine
AT kerlavageanthonyr acomprehensiveinfrastructureforbigdataincancerresearchacceleratingcancerresearchandprecisionmedicine
AT kibbewarrena acomprehensiveinfrastructureforbigdataincancerresearchacceleratingcancerresearchandprecisionmedicine
AT hinksonizumiv comprehensiveinfrastructureforbigdataincancerresearchacceleratingcancerresearchandprecisionmedicine
AT davidsentanjam comprehensiveinfrastructureforbigdataincancerresearchacceleratingcancerresearchandprecisionmedicine
AT klemmjulid comprehensiveinfrastructureforbigdataincancerresearchacceleratingcancerresearchandprecisionmedicine
AT chandramouliswaranishwar comprehensiveinfrastructureforbigdataincancerresearchacceleratingcancerresearchandprecisionmedicine
AT kerlavageanthonyr comprehensiveinfrastructureforbigdataincancerresearchacceleratingcancerresearchandprecisionmedicine
AT kibbewarrena comprehensiveinfrastructureforbigdataincancerresearchacceleratingcancerresearchandprecisionmedicine