Cargando…

TCGA Workflow: Analyze cancer genomics and epigenomics data using Bioconductor packages

Biotechnological advances in sequencing have led to an explosion of publicly available data via large international consortia such as The Cancer Genome Atlas (TCGA), The Encyclopedia of DNA Elements (ENCODE), and The NIH Roadmap Epigenomics Mapping Consortium (Roadmap). These projects have provided...

Descripción completa

Detalles Bibliográficos
Autores principales: Silva, Tiago C., Colaprico, Antonio, Olsen, Catharina, D'Angelo, Fulvio, Bontempi, Gianluca, Ceccarelli, Michele, Noushmehr, Houtan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000Research 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5302158/
https://www.ncbi.nlm.nih.gov/pubmed/28232861
http://dx.doi.org/10.12688/f1000research.8923.2
_version_ 1782506488717115392
author Silva, Tiago C.
Colaprico, Antonio
Olsen, Catharina
D'Angelo, Fulvio
Bontempi, Gianluca
Ceccarelli, Michele
Noushmehr, Houtan
author_facet Silva, Tiago C.
Colaprico, Antonio
Olsen, Catharina
D'Angelo, Fulvio
Bontempi, Gianluca
Ceccarelli, Michele
Noushmehr, Houtan
author_sort Silva, Tiago C.
collection PubMed
description Biotechnological advances in sequencing have led to an explosion of publicly available data via large international consortia such as The Cancer Genome Atlas (TCGA), The Encyclopedia of DNA Elements (ENCODE), and The NIH Roadmap Epigenomics Mapping Consortium (Roadmap). These projects have provided unprecedented opportunities to interrogate the epigenome of cultured cancer cell lines as well as normal and tumor tissues with high genomic resolution. The Bioconductor project offers more than 1,000 open-source software and statistical packages to analyze high-throughput genomic data. However, most packages are designed for specific data types (e.g. expression, epigenetics, genomics) and there is no one comprehensive tool that provides a complete integrative analysis of the resources and data provided by all three public projects. A need to create an integration of these different analyses was recently proposed. In this workflow, we provide a series of biologically focused integrative analyses of different molecular data. We describe how to download, process and prepare TCGA data and by harnessing several key Bioconductor packages, we describe how to extract biologically meaningful genomic and epigenomic data. Using Roadmap and ENCODE data, we provide a work plan to identify biologically relevant functional epigenomic elements associated with cancer. To illustrate our workflow, we analyzed two types of brain tumors: low-grade glioma (LGG) versus high-grade glioma (glioblastoma multiform or GBM). This workflow introduces the following Bioconductor packages: AnnotationHub, ChIPSeeker, ComplexHeatmap, pathview, ELMER, GAIA, MINET, RTCGAToolbox,  TCGAbiolinks.
format Online
Article
Text
id pubmed-5302158
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher F1000Research
record_format MEDLINE/PubMed
spelling pubmed-53021582017-02-22 TCGA Workflow: Analyze cancer genomics and epigenomics data using Bioconductor packages Silva, Tiago C. Colaprico, Antonio Olsen, Catharina D'Angelo, Fulvio Bontempi, Gianluca Ceccarelli, Michele Noushmehr, Houtan F1000Res Software Tool Article Biotechnological advances in sequencing have led to an explosion of publicly available data via large international consortia such as The Cancer Genome Atlas (TCGA), The Encyclopedia of DNA Elements (ENCODE), and The NIH Roadmap Epigenomics Mapping Consortium (Roadmap). These projects have provided unprecedented opportunities to interrogate the epigenome of cultured cancer cell lines as well as normal and tumor tissues with high genomic resolution. The Bioconductor project offers more than 1,000 open-source software and statistical packages to analyze high-throughput genomic data. However, most packages are designed for specific data types (e.g. expression, epigenetics, genomics) and there is no one comprehensive tool that provides a complete integrative analysis of the resources and data provided by all three public projects. A need to create an integration of these different analyses was recently proposed. In this workflow, we provide a series of biologically focused integrative analyses of different molecular data. We describe how to download, process and prepare TCGA data and by harnessing several key Bioconductor packages, we describe how to extract biologically meaningful genomic and epigenomic data. Using Roadmap and ENCODE data, we provide a work plan to identify biologically relevant functional epigenomic elements associated with cancer. To illustrate our workflow, we analyzed two types of brain tumors: low-grade glioma (LGG) versus high-grade glioma (glioblastoma multiform or GBM). This workflow introduces the following Bioconductor packages: AnnotationHub, ChIPSeeker, ComplexHeatmap, pathview, ELMER, GAIA, MINET, RTCGAToolbox,  TCGAbiolinks. F1000Research 2016-12-28 /pmc/articles/PMC5302158/ /pubmed/28232861 http://dx.doi.org/10.12688/f1000research.8923.2 Text en Copyright: © 2016 Silva TC et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software Tool Article
Silva, Tiago C.
Colaprico, Antonio
Olsen, Catharina
D'Angelo, Fulvio
Bontempi, Gianluca
Ceccarelli, Michele
Noushmehr, Houtan
TCGA Workflow: Analyze cancer genomics and epigenomics data using Bioconductor packages
title TCGA Workflow: Analyze cancer genomics and epigenomics data using Bioconductor packages
title_full TCGA Workflow: Analyze cancer genomics and epigenomics data using Bioconductor packages
title_fullStr TCGA Workflow: Analyze cancer genomics and epigenomics data using Bioconductor packages
title_full_unstemmed TCGA Workflow: Analyze cancer genomics and epigenomics data using Bioconductor packages
title_short TCGA Workflow: Analyze cancer genomics and epigenomics data using Bioconductor packages
title_sort tcga workflow: analyze cancer genomics and epigenomics data using bioconductor packages
topic Software Tool Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5302158/
https://www.ncbi.nlm.nih.gov/pubmed/28232861
http://dx.doi.org/10.12688/f1000research.8923.2
work_keys_str_mv AT silvatiagoc tcgaworkflowanalyzecancergenomicsandepigenomicsdatausingbioconductorpackages
AT colapricoantonio tcgaworkflowanalyzecancergenomicsandepigenomicsdatausingbioconductorpackages
AT olsencatharina tcgaworkflowanalyzecancergenomicsandepigenomicsdatausingbioconductorpackages
AT dangelofulvio tcgaworkflowanalyzecancergenomicsandepigenomicsdatausingbioconductorpackages
AT bontempigianluca tcgaworkflowanalyzecancergenomicsandepigenomicsdatausingbioconductorpackages
AT ceccarellimichele tcgaworkflowanalyzecancergenomicsandepigenomicsdatausingbioconductorpackages
AT noushmehrhoutan tcgaworkflowanalyzecancergenomicsandepigenomicsdatausingbioconductorpackages