Cargando…
Improving existing analysis pipeline to identify and analyze cancer driver genes using multi-omics data
The cumulative of genes carrying mutations is vital for the establishment and development of cancer. However, this driver gene exploring research line has selected and used types of tools and models of analysis unsystematically and discretely. Also, the previous studies may have neglected low-freque...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7688645/ https://www.ncbi.nlm.nih.gov/pubmed/33239644 http://dx.doi.org/10.1038/s41598-020-77318-1 |
_version_ | 1783613732446797824 |
---|---|
author | Nguyen, Quang-Huy Le, Duc-Hau |
author_facet | Nguyen, Quang-Huy Le, Duc-Hau |
author_sort | Nguyen, Quang-Huy |
collection | PubMed |
description | The cumulative of genes carrying mutations is vital for the establishment and development of cancer. However, this driver gene exploring research line has selected and used types of tools and models of analysis unsystematically and discretely. Also, the previous studies may have neglected low-frequency drivers and seldom predicted subgroup specificities of identified driver genes. In this study, we presented an improved driver gene identification and analysis pipeline that comprises the four most widely focused analyses for driver genes: enrichment analysis, clinical feature association with expression profiles of identified driver genes as well as with their functional modules, and patient stratification by existing advanced computational tools integrating multi-omics data. The improved pipeline's general usability was demonstrated straightforwardly for breast cancer, validated by some independent databases. Accordingly, 31 validated driver genes, including four novel ones, were discovered. Subsequently, we detected cancer-related significantly enriched gene ontology terms and pathways, probable drug targets, two co-expressed modules associated significantly with several clinical features, such as number of positive lymph nodes, Nottingham prognostic index, and tumor stage, and two biologically distinct groups of BRCA patients. Data and source code of the case study can be downloaded at https://github.com/hauldhut/drivergene. |
format | Online Article Text |
id | pubmed-7688645 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-76886452020-11-27 Improving existing analysis pipeline to identify and analyze cancer driver genes using multi-omics data Nguyen, Quang-Huy Le, Duc-Hau Sci Rep Article The cumulative of genes carrying mutations is vital for the establishment and development of cancer. However, this driver gene exploring research line has selected and used types of tools and models of analysis unsystematically and discretely. Also, the previous studies may have neglected low-frequency drivers and seldom predicted subgroup specificities of identified driver genes. In this study, we presented an improved driver gene identification and analysis pipeline that comprises the four most widely focused analyses for driver genes: enrichment analysis, clinical feature association with expression profiles of identified driver genes as well as with their functional modules, and patient stratification by existing advanced computational tools integrating multi-omics data. The improved pipeline's general usability was demonstrated straightforwardly for breast cancer, validated by some independent databases. Accordingly, 31 validated driver genes, including four novel ones, were discovered. Subsequently, we detected cancer-related significantly enriched gene ontology terms and pathways, probable drug targets, two co-expressed modules associated significantly with several clinical features, such as number of positive lymph nodes, Nottingham prognostic index, and tumor stage, and two biologically distinct groups of BRCA patients. Data and source code of the case study can be downloaded at https://github.com/hauldhut/drivergene. Nature Publishing Group UK 2020-11-25 /pmc/articles/PMC7688645/ /pubmed/33239644 http://dx.doi.org/10.1038/s41598-020-77318-1 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Nguyen, Quang-Huy Le, Duc-Hau Improving existing analysis pipeline to identify and analyze cancer driver genes using multi-omics data |
title | Improving existing analysis pipeline to identify and analyze cancer driver genes using multi-omics data |
title_full | Improving existing analysis pipeline to identify and analyze cancer driver genes using multi-omics data |
title_fullStr | Improving existing analysis pipeline to identify and analyze cancer driver genes using multi-omics data |
title_full_unstemmed | Improving existing analysis pipeline to identify and analyze cancer driver genes using multi-omics data |
title_short | Improving existing analysis pipeline to identify and analyze cancer driver genes using multi-omics data |
title_sort | improving existing analysis pipeline to identify and analyze cancer driver genes using multi-omics data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7688645/ https://www.ncbi.nlm.nih.gov/pubmed/33239644 http://dx.doi.org/10.1038/s41598-020-77318-1 |
work_keys_str_mv | AT nguyenquanghuy improvingexistinganalysispipelinetoidentifyandanalyzecancerdrivergenesusingmultiomicsdata AT leduchau improvingexistinganalysispipelinetoidentifyandanalyzecancerdrivergenesusingmultiomicsdata |