Cargando…

Improving existing analysis pipeline to identify and analyze cancer driver genes using multi-omics data

The cumulative of genes carrying mutations is vital for the establishment and development of cancer. However, this driver gene exploring research line has selected and used types of tools and models of analysis unsystematically and discretely. Also, the previous studies may have neglected low-freque...

Descripción completa

Detalles Bibliográficos
Autores principales: Nguyen, Quang-Huy, Le, Duc-Hau
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7688645/
https://www.ncbi.nlm.nih.gov/pubmed/33239644
http://dx.doi.org/10.1038/s41598-020-77318-1
_version_ 1783613732446797824
author Nguyen, Quang-Huy
Le, Duc-Hau
author_facet Nguyen, Quang-Huy
Le, Duc-Hau
author_sort Nguyen, Quang-Huy
collection PubMed
description The cumulative of genes carrying mutations is vital for the establishment and development of cancer. However, this driver gene exploring research line has selected and used types of tools and models of analysis unsystematically and discretely. Also, the previous studies may have neglected low-frequency drivers and seldom predicted subgroup specificities of identified driver genes. In this study, we presented an improved driver gene identification and analysis pipeline that comprises the four most widely focused analyses for driver genes: enrichment analysis, clinical feature association with expression profiles of identified driver genes as well as with their functional modules, and patient stratification by existing advanced computational tools integrating multi-omics data. The improved pipeline's general usability was demonstrated straightforwardly for breast cancer, validated by some independent databases. Accordingly, 31 validated driver genes, including four novel ones, were discovered. Subsequently, we detected cancer-related significantly enriched gene ontology terms and pathways, probable drug targets, two co-expressed modules associated significantly with several clinical features, such as number of positive lymph nodes, Nottingham prognostic index, and tumor stage, and two biologically distinct groups of BRCA patients. Data and source code of the case study can be downloaded at https://github.com/hauldhut/drivergene.
format Online
Article
Text
id pubmed-7688645
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-76886452020-11-27 Improving existing analysis pipeline to identify and analyze cancer driver genes using multi-omics data Nguyen, Quang-Huy Le, Duc-Hau Sci Rep Article The cumulative of genes carrying mutations is vital for the establishment and development of cancer. However, this driver gene exploring research line has selected and used types of tools and models of analysis unsystematically and discretely. Also, the previous studies may have neglected low-frequency drivers and seldom predicted subgroup specificities of identified driver genes. In this study, we presented an improved driver gene identification and analysis pipeline that comprises the four most widely focused analyses for driver genes: enrichment analysis, clinical feature association with expression profiles of identified driver genes as well as with their functional modules, and patient stratification by existing advanced computational tools integrating multi-omics data. The improved pipeline's general usability was demonstrated straightforwardly for breast cancer, validated by some independent databases. Accordingly, 31 validated driver genes, including four novel ones, were discovered. Subsequently, we detected cancer-related significantly enriched gene ontology terms and pathways, probable drug targets, two co-expressed modules associated significantly with several clinical features, such as number of positive lymph nodes, Nottingham prognostic index, and tumor stage, and two biologically distinct groups of BRCA patients. Data and source code of the case study can be downloaded at https://github.com/hauldhut/drivergene. Nature Publishing Group UK 2020-11-25 /pmc/articles/PMC7688645/ /pubmed/33239644 http://dx.doi.org/10.1038/s41598-020-77318-1 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Nguyen, Quang-Huy
Le, Duc-Hau
Improving existing analysis pipeline to identify and analyze cancer driver genes using multi-omics data
title Improving existing analysis pipeline to identify and analyze cancer driver genes using multi-omics data
title_full Improving existing analysis pipeline to identify and analyze cancer driver genes using multi-omics data
title_fullStr Improving existing analysis pipeline to identify and analyze cancer driver genes using multi-omics data
title_full_unstemmed Improving existing analysis pipeline to identify and analyze cancer driver genes using multi-omics data
title_short Improving existing analysis pipeline to identify and analyze cancer driver genes using multi-omics data
title_sort improving existing analysis pipeline to identify and analyze cancer driver genes using multi-omics data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7688645/
https://www.ncbi.nlm.nih.gov/pubmed/33239644
http://dx.doi.org/10.1038/s41598-020-77318-1
work_keys_str_mv AT nguyenquanghuy improvingexistinganalysispipelinetoidentifyandanalyzecancerdrivergenesusingmultiomicsdata
AT leduchau improvingexistinganalysispipelinetoidentifyandanalyzecancerdrivergenesusingmultiomicsdata