Cargando…
TCGA Workflow: Analyze cancer genomics and epigenomics data using Bioconductor packages
Biotechnological advances in sequencing have led to an explosion of publicly available data via large international consortia such as The Cancer Genome Atlas (TCGA), The Encyclopedia of DNA Elements (ENCODE), and The NIH Roadmap Epigenomics Mapping Consortium (Roadmap). These projects have provided...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
F1000Research
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5302158/ https://www.ncbi.nlm.nih.gov/pubmed/28232861 http://dx.doi.org/10.12688/f1000research.8923.2 |
_version_ | 1782506488717115392 |
---|---|
author | Silva, Tiago C. Colaprico, Antonio Olsen, Catharina D'Angelo, Fulvio Bontempi, Gianluca Ceccarelli, Michele Noushmehr, Houtan |
author_facet | Silva, Tiago C. Colaprico, Antonio Olsen, Catharina D'Angelo, Fulvio Bontempi, Gianluca Ceccarelli, Michele Noushmehr, Houtan |
author_sort | Silva, Tiago C. |
collection | PubMed |
description | Biotechnological advances in sequencing have led to an explosion of publicly available data via large international consortia such as The Cancer Genome Atlas (TCGA), The Encyclopedia of DNA Elements (ENCODE), and The NIH Roadmap Epigenomics Mapping Consortium (Roadmap). These projects have provided unprecedented opportunities to interrogate the epigenome of cultured cancer cell lines as well as normal and tumor tissues with high genomic resolution. The Bioconductor project offers more than 1,000 open-source software and statistical packages to analyze high-throughput genomic data. However, most packages are designed for specific data types (e.g. expression, epigenetics, genomics) and there is no one comprehensive tool that provides a complete integrative analysis of the resources and data provided by all three public projects. A need to create an integration of these different analyses was recently proposed. In this workflow, we provide a series of biologically focused integrative analyses of different molecular data. We describe how to download, process and prepare TCGA data and by harnessing several key Bioconductor packages, we describe how to extract biologically meaningful genomic and epigenomic data. Using Roadmap and ENCODE data, we provide a work plan to identify biologically relevant functional epigenomic elements associated with cancer. To illustrate our workflow, we analyzed two types of brain tumors: low-grade glioma (LGG) versus high-grade glioma (glioblastoma multiform or GBM). This workflow introduces the following Bioconductor packages: AnnotationHub, ChIPSeeker, ComplexHeatmap, pathview, ELMER, GAIA, MINET, RTCGAToolbox, TCGAbiolinks. |
format | Online Article Text |
id | pubmed-5302158 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | F1000Research |
record_format | MEDLINE/PubMed |
spelling | pubmed-53021582017-02-22 TCGA Workflow: Analyze cancer genomics and epigenomics data using Bioconductor packages Silva, Tiago C. Colaprico, Antonio Olsen, Catharina D'Angelo, Fulvio Bontempi, Gianluca Ceccarelli, Michele Noushmehr, Houtan F1000Res Software Tool Article Biotechnological advances in sequencing have led to an explosion of publicly available data via large international consortia such as The Cancer Genome Atlas (TCGA), The Encyclopedia of DNA Elements (ENCODE), and The NIH Roadmap Epigenomics Mapping Consortium (Roadmap). These projects have provided unprecedented opportunities to interrogate the epigenome of cultured cancer cell lines as well as normal and tumor tissues with high genomic resolution. The Bioconductor project offers more than 1,000 open-source software and statistical packages to analyze high-throughput genomic data. However, most packages are designed for specific data types (e.g. expression, epigenetics, genomics) and there is no one comprehensive tool that provides a complete integrative analysis of the resources and data provided by all three public projects. A need to create an integration of these different analyses was recently proposed. In this workflow, we provide a series of biologically focused integrative analyses of different molecular data. We describe how to download, process and prepare TCGA data and by harnessing several key Bioconductor packages, we describe how to extract biologically meaningful genomic and epigenomic data. Using Roadmap and ENCODE data, we provide a work plan to identify biologically relevant functional epigenomic elements associated with cancer. To illustrate our workflow, we analyzed two types of brain tumors: low-grade glioma (LGG) versus high-grade glioma (glioblastoma multiform or GBM). This workflow introduces the following Bioconductor packages: AnnotationHub, ChIPSeeker, ComplexHeatmap, pathview, ELMER, GAIA, MINET, RTCGAToolbox, TCGAbiolinks. F1000Research 2016-12-28 /pmc/articles/PMC5302158/ /pubmed/28232861 http://dx.doi.org/10.12688/f1000research.8923.2 Text en Copyright: © 2016 Silva TC et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Software Tool Article Silva, Tiago C. Colaprico, Antonio Olsen, Catharina D'Angelo, Fulvio Bontempi, Gianluca Ceccarelli, Michele Noushmehr, Houtan TCGA Workflow: Analyze cancer genomics and epigenomics data using Bioconductor packages |
title |
TCGA Workflow: Analyze cancer genomics and epigenomics data using Bioconductor packages |
title_full |
TCGA Workflow: Analyze cancer genomics and epigenomics data using Bioconductor packages |
title_fullStr |
TCGA Workflow: Analyze cancer genomics and epigenomics data using Bioconductor packages |
title_full_unstemmed |
TCGA Workflow: Analyze cancer genomics and epigenomics data using Bioconductor packages |
title_short |
TCGA Workflow: Analyze cancer genomics and epigenomics data using Bioconductor packages |
title_sort | tcga workflow: analyze cancer genomics and epigenomics data using bioconductor packages |
topic | Software Tool Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5302158/ https://www.ncbi.nlm.nih.gov/pubmed/28232861 http://dx.doi.org/10.12688/f1000research.8923.2 |
work_keys_str_mv | AT silvatiagoc tcgaworkflowanalyzecancergenomicsandepigenomicsdatausingbioconductorpackages AT colapricoantonio tcgaworkflowanalyzecancergenomicsandepigenomicsdatausingbioconductorpackages AT olsencatharina tcgaworkflowanalyzecancergenomicsandepigenomicsdatausingbioconductorpackages AT dangelofulvio tcgaworkflowanalyzecancergenomicsandepigenomicsdatausingbioconductorpackages AT bontempigianluca tcgaworkflowanalyzecancergenomicsandepigenomicsdatausingbioconductorpackages AT ceccarellimichele tcgaworkflowanalyzecancergenomicsandepigenomicsdatausingbioconductorpackages AT noushmehrhoutan tcgaworkflowanalyzecancergenomicsandepigenomicsdatausingbioconductorpackages |