Cargando…

A data processing pipeline for the AACR project GENIE biopharma collaborative data with the {genieBPC} R package

MOTIVATION: Data from the American Association for Cancer Research Project Genomics Evidence Neoplasia Information Exchange Biopharma Collaborative (GENIE BPC) represent comprehensive clinical data linked to high-throughput sequencing data, providing a multi-institution, pan-cancer, publicly availab...

Descripción completa

Detalles Bibliográficos
Autores principales: Lavery, Jessica A, Brown, Samantha, Curry, Michael A, Martin, Axel, Sjoberg, Daniel D, Whiting, Karissa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9822536/
https://www.ncbi.nlm.nih.gov/pubmed/36519837
http://dx.doi.org/10.1093/bioinformatics/btac796
Descripción
Sumario:MOTIVATION: Data from the American Association for Cancer Research Project Genomics Evidence Neoplasia Information Exchange Biopharma Collaborative (GENIE BPC) represent comprehensive clinical data linked to high-throughput sequencing data, providing a multi-institution, pan-cancer, publicly available data repository. GENIE BPC data provide detailed demographic, clinical, treatment, genomic and outcome data for patients with cancer. These data result in a unique observational database of molecularly characterized tumors with comprehensive clinical annotation that can be used for health outcomes and precision medicine research in oncology. Due to the inherently complex structure of the multiple phenomic and genomic datasets, the use of these data requires a robust process for data integration and preparation in order to build analytic models. RESULTS: We present the {genieBPC} package, a user-friendly data processing pipeline to facilitate the creation of analytic cohorts from the GENIE BPC data that are ready for clinico-genomic modeling and analyses. AVAILABILITY AND IMPLEMENTATION: {genieBPC} is available on CRAN and GitHub.