Cargando…
TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data
The Cancer Genome Atlas (TCGA) research network has made public a large collection of clinical and molecular phenotypes of more than 10 000 tumor patients across 33 different tumor types. Using this cohort, TCGA has published over 20 marker papers detailing the genomic and epigenomic alterations ass...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4856967/ https://www.ncbi.nlm.nih.gov/pubmed/26704973 http://dx.doi.org/10.1093/nar/gkv1507 |
_version_ | 1782430573370802176 |
---|---|
author | Colaprico, Antonio Silva, Tiago C. Olsen, Catharina Garofano, Luciano Cava, Claudia Garolini, Davide Sabedot, Thais S. Malta, Tathiane M. Pagnotta, Stefano M. Castiglioni, Isabella Ceccarelli, Michele Bontempi, Gianluca Noushmehr, Houtan |
author_facet | Colaprico, Antonio Silva, Tiago C. Olsen, Catharina Garofano, Luciano Cava, Claudia Garolini, Davide Sabedot, Thais S. Malta, Tathiane M. Pagnotta, Stefano M. Castiglioni, Isabella Ceccarelli, Michele Bontempi, Gianluca Noushmehr, Houtan |
author_sort | Colaprico, Antonio |
collection | PubMed |
description | The Cancer Genome Atlas (TCGA) research network has made public a large collection of clinical and molecular phenotypes of more than 10 000 tumor patients across 33 different tumor types. Using this cohort, TCGA has published over 20 marker papers detailing the genomic and epigenomic alterations associated with these tumor types. Although many important discoveries have been made by TCGA's research network, opportunities still exist to implement novel methods, thereby elucidating new biological pathways and diagnostic markers. However, mining the TCGA data presents several bioinformatics challenges, such as data retrieval and integration with clinical data and other molecular data types (e.g. RNA and DNA methylation). We developed an R/Bioconductor package called TCGAbiolinks to address these challenges and offer bioinformatics solutions by using a guided workflow to allow users to query, download and perform integrative analyses of TCGA data. We combined methods from computer science and statistics into the pipeline and incorporated methodologies developed in previous TCGA marker studies and in our own group. Using four different TCGA tumor types (Kidney, Brain, Breast and Colon) as examples, we provide case studies to illustrate examples of reproducibility, integrative analysis and utilization of different Bioconductor packages to advance and accelerate novel discoveries. |
format | Online Article Text |
id | pubmed-4856967 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-48569672016-05-09 TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data Colaprico, Antonio Silva, Tiago C. Olsen, Catharina Garofano, Luciano Cava, Claudia Garolini, Davide Sabedot, Thais S. Malta, Tathiane M. Pagnotta, Stefano M. Castiglioni, Isabella Ceccarelli, Michele Bontempi, Gianluca Noushmehr, Houtan Nucleic Acids Res Methods Online The Cancer Genome Atlas (TCGA) research network has made public a large collection of clinical and molecular phenotypes of more than 10 000 tumor patients across 33 different tumor types. Using this cohort, TCGA has published over 20 marker papers detailing the genomic and epigenomic alterations associated with these tumor types. Although many important discoveries have been made by TCGA's research network, opportunities still exist to implement novel methods, thereby elucidating new biological pathways and diagnostic markers. However, mining the TCGA data presents several bioinformatics challenges, such as data retrieval and integration with clinical data and other molecular data types (e.g. RNA and DNA methylation). We developed an R/Bioconductor package called TCGAbiolinks to address these challenges and offer bioinformatics solutions by using a guided workflow to allow users to query, download and perform integrative analyses of TCGA data. We combined methods from computer science and statistics into the pipeline and incorporated methodologies developed in previous TCGA marker studies and in our own group. Using four different TCGA tumor types (Kidney, Brain, Breast and Colon) as examples, we provide case studies to illustrate examples of reproducibility, integrative analysis and utilization of different Bioconductor packages to advance and accelerate novel discoveries. Oxford University Press 2016-05-05 2015-12-23 /pmc/articles/PMC4856967/ /pubmed/26704973 http://dx.doi.org/10.1093/nar/gkv1507 Text en © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Methods Online Colaprico, Antonio Silva, Tiago C. Olsen, Catharina Garofano, Luciano Cava, Claudia Garolini, Davide Sabedot, Thais S. Malta, Tathiane M. Pagnotta, Stefano M. Castiglioni, Isabella Ceccarelli, Michele Bontempi, Gianluca Noushmehr, Houtan TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data |
title | TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data |
title_full | TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data |
title_fullStr | TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data |
title_full_unstemmed | TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data |
title_short | TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data |
title_sort | tcgabiolinks: an r/bioconductor package for integrative analysis of tcga data |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4856967/ https://www.ncbi.nlm.nih.gov/pubmed/26704973 http://dx.doi.org/10.1093/nar/gkv1507 |
work_keys_str_mv | AT colapricoantonio tcgabiolinksanrbioconductorpackageforintegrativeanalysisoftcgadata AT silvatiagoc tcgabiolinksanrbioconductorpackageforintegrativeanalysisoftcgadata AT olsencatharina tcgabiolinksanrbioconductorpackageforintegrativeanalysisoftcgadata AT garofanoluciano tcgabiolinksanrbioconductorpackageforintegrativeanalysisoftcgadata AT cavaclaudia tcgabiolinksanrbioconductorpackageforintegrativeanalysisoftcgadata AT garolinidavide tcgabiolinksanrbioconductorpackageforintegrativeanalysisoftcgadata AT sabedotthaiss tcgabiolinksanrbioconductorpackageforintegrativeanalysisoftcgadata AT maltatathianem tcgabiolinksanrbioconductorpackageforintegrativeanalysisoftcgadata AT pagnottastefanom tcgabiolinksanrbioconductorpackageforintegrativeanalysisoftcgadata AT castiglioniisabella tcgabiolinksanrbioconductorpackageforintegrativeanalysisoftcgadata AT ceccarellimichele tcgabiolinksanrbioconductorpackageforintegrativeanalysisoftcgadata AT bontempigianluca tcgabiolinksanrbioconductorpackageforintegrativeanalysisoftcgadata AT noushmehrhoutan tcgabiolinksanrbioconductorpackageforintegrativeanalysisoftcgadata |