Cargando…

TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data

The Cancer Genome Atlas (TCGA) research network has made public a large collection of clinical and molecular phenotypes of more than 10 000 tumor patients across 33 different tumor types. Using this cohort, TCGA has published over 20 marker papers detailing the genomic and epigenomic alterations ass...

Descripción completa

Detalles Bibliográficos
Autores principales: Colaprico, Antonio, Silva, Tiago C., Olsen, Catharina, Garofano, Luciano, Cava, Claudia, Garolini, Davide, Sabedot, Thais S., Malta, Tathiane M., Pagnotta, Stefano M., Castiglioni, Isabella, Ceccarelli, Michele, Bontempi, Gianluca, Noushmehr, Houtan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4856967/
https://www.ncbi.nlm.nih.gov/pubmed/26704973
http://dx.doi.org/10.1093/nar/gkv1507
_version_ 1782430573370802176
author Colaprico, Antonio
Silva, Tiago C.
Olsen, Catharina
Garofano, Luciano
Cava, Claudia
Garolini, Davide
Sabedot, Thais S.
Malta, Tathiane M.
Pagnotta, Stefano M.
Castiglioni, Isabella
Ceccarelli, Michele
Bontempi, Gianluca
Noushmehr, Houtan
author_facet Colaprico, Antonio
Silva, Tiago C.
Olsen, Catharina
Garofano, Luciano
Cava, Claudia
Garolini, Davide
Sabedot, Thais S.
Malta, Tathiane M.
Pagnotta, Stefano M.
Castiglioni, Isabella
Ceccarelli, Michele
Bontempi, Gianluca
Noushmehr, Houtan
author_sort Colaprico, Antonio
collection PubMed
description The Cancer Genome Atlas (TCGA) research network has made public a large collection of clinical and molecular phenotypes of more than 10 000 tumor patients across 33 different tumor types. Using this cohort, TCGA has published over 20 marker papers detailing the genomic and epigenomic alterations associated with these tumor types. Although many important discoveries have been made by TCGA's research network, opportunities still exist to implement novel methods, thereby elucidating new biological pathways and diagnostic markers. However, mining the TCGA data presents several bioinformatics challenges, such as data retrieval and integration with clinical data and other molecular data types (e.g. RNA and DNA methylation). We developed an R/Bioconductor package called TCGAbiolinks to address these challenges and offer bioinformatics solutions by using a guided workflow to allow users to query, download and perform integrative analyses of TCGA data. We combined methods from computer science and statistics into the pipeline and incorporated methodologies developed in previous TCGA marker studies and in our own group. Using four different TCGA tumor types (Kidney, Brain, Breast and Colon) as examples, we provide case studies to illustrate examples of reproducibility, integrative analysis and utilization of different Bioconductor packages to advance and accelerate novel discoveries.
format Online
Article
Text
id pubmed-4856967
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-48569672016-05-09 TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data Colaprico, Antonio Silva, Tiago C. Olsen, Catharina Garofano, Luciano Cava, Claudia Garolini, Davide Sabedot, Thais S. Malta, Tathiane M. Pagnotta, Stefano M. Castiglioni, Isabella Ceccarelli, Michele Bontempi, Gianluca Noushmehr, Houtan Nucleic Acids Res Methods Online The Cancer Genome Atlas (TCGA) research network has made public a large collection of clinical and molecular phenotypes of more than 10 000 tumor patients across 33 different tumor types. Using this cohort, TCGA has published over 20 marker papers detailing the genomic and epigenomic alterations associated with these tumor types. Although many important discoveries have been made by TCGA's research network, opportunities still exist to implement novel methods, thereby elucidating new biological pathways and diagnostic markers. However, mining the TCGA data presents several bioinformatics challenges, such as data retrieval and integration with clinical data and other molecular data types (e.g. RNA and DNA methylation). We developed an R/Bioconductor package called TCGAbiolinks to address these challenges and offer bioinformatics solutions by using a guided workflow to allow users to query, download and perform integrative analyses of TCGA data. We combined methods from computer science and statistics into the pipeline and incorporated methodologies developed in previous TCGA marker studies and in our own group. Using four different TCGA tumor types (Kidney, Brain, Breast and Colon) as examples, we provide case studies to illustrate examples of reproducibility, integrative analysis and utilization of different Bioconductor packages to advance and accelerate novel discoveries. Oxford University Press 2016-05-05 2015-12-23 /pmc/articles/PMC4856967/ /pubmed/26704973 http://dx.doi.org/10.1093/nar/gkv1507 Text en © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Colaprico, Antonio
Silva, Tiago C.
Olsen, Catharina
Garofano, Luciano
Cava, Claudia
Garolini, Davide
Sabedot, Thais S.
Malta, Tathiane M.
Pagnotta, Stefano M.
Castiglioni, Isabella
Ceccarelli, Michele
Bontempi, Gianluca
Noushmehr, Houtan
TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data
title TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data
title_full TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data
title_fullStr TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data
title_full_unstemmed TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data
title_short TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data
title_sort tcgabiolinks: an r/bioconductor package for integrative analysis of tcga data
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4856967/
https://www.ncbi.nlm.nih.gov/pubmed/26704973
http://dx.doi.org/10.1093/nar/gkv1507
work_keys_str_mv AT colapricoantonio tcgabiolinksanrbioconductorpackageforintegrativeanalysisoftcgadata
AT silvatiagoc tcgabiolinksanrbioconductorpackageforintegrativeanalysisoftcgadata
AT olsencatharina tcgabiolinksanrbioconductorpackageforintegrativeanalysisoftcgadata
AT garofanoluciano tcgabiolinksanrbioconductorpackageforintegrativeanalysisoftcgadata
AT cavaclaudia tcgabiolinksanrbioconductorpackageforintegrativeanalysisoftcgadata
AT garolinidavide tcgabiolinksanrbioconductorpackageforintegrativeanalysisoftcgadata
AT sabedotthaiss tcgabiolinksanrbioconductorpackageforintegrativeanalysisoftcgadata
AT maltatathianem tcgabiolinksanrbioconductorpackageforintegrativeanalysisoftcgadata
AT pagnottastefanom tcgabiolinksanrbioconductorpackageforintegrativeanalysisoftcgadata
AT castiglioniisabella tcgabiolinksanrbioconductorpackageforintegrativeanalysisoftcgadata
AT ceccarellimichele tcgabiolinksanrbioconductorpackageforintegrativeanalysisoftcgadata
AT bontempigianluca tcgabiolinksanrbioconductorpackageforintegrativeanalysisoftcgadata
AT noushmehrhoutan tcgabiolinksanrbioconductorpackageforintegrativeanalysisoftcgadata