Cargando…
Multiple Criteria Optimization (MCO): A gene selection deterministic tool in RStudio
Identifying genes with the largest expression changes (gene selection) to characterize a given condition is a popular first step to drive exploration into molecular mechanisms and is, therefore, paramount for therapeutic development. Reproducibility in the sciences makes it necessary to emphasize ob...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8794188/ https://www.ncbi.nlm.nih.gov/pubmed/35085348 http://dx.doi.org/10.1371/journal.pone.0262890 |
_version_ | 1784640777239920640 |
---|---|
author | Narváez-Bandera, Isis Suárez-Gómez, Deiver Isaza, Clara E. Cabrera-Ríos, Mauricio |
author_facet | Narváez-Bandera, Isis Suárez-Gómez, Deiver Isaza, Clara E. Cabrera-Ríos, Mauricio |
author_sort | Narváez-Bandera, Isis |
collection | PubMed |
description | Identifying genes with the largest expression changes (gene selection) to characterize a given condition is a popular first step to drive exploration into molecular mechanisms and is, therefore, paramount for therapeutic development. Reproducibility in the sciences makes it necessary to emphasize objectivity and systematic repeatability in biological and informatics analyses, including gene selection. With these two characteristics in mind, in previous works our research team has proposed using multiple criteria optimization (MCO) in gene selection to analyze microarray datasets. The result of this effort is the MCO algorithm, which selects genes with the largest expression changes without user manipulation of neither informatics nor statistical parameters. Furthermore, the user is not required to choose either a preference structure among multiple measures or a predetermined quantity of genes to be deemed significant a priori. This implies that using the same datasets and performance measures (PMs), the method will converge to the same set of selected differentially expressed genes (repeatability) despite who carries out the analysis (objectivity). The present work describes the development of an open-source tool in RStudio to enable both: (1) individual analysis of single datasets with two or three PMs and (2) meta-analysis with up to five microarray datasets, using one PM from each dataset. The capabilities afforded by the code include license-free portability and the possibility to carry out analyses via modest computer hardware, such as personal laptops. The code provides affordable, repeatable, and objective detection of differentially expressed genes from microarrays. It can be used to analyze other experiments with similar experimental comparative layouts, such as microRNA arrays and protein arrays, among others. As a demonstration of the capabilities of the code, the analysis of four publicly-available microarray datasets related to Parkinson´s Disease (PD) is presented here, treating each dataset individually or as a four-way meta-analysis. These MCO-supported analyses made it possible to identify MMP9 and TUBB2A as potential PD genetic biomarkers based on their persistent appearance across each of the case studies. A literature search confirmed the importance of these genes in PD and indeed as PD biomarkers, which evidences the code´s potential. |
format | Online Article Text |
id | pubmed-8794188 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-87941882022-01-28 Multiple Criteria Optimization (MCO): A gene selection deterministic tool in RStudio Narváez-Bandera, Isis Suárez-Gómez, Deiver Isaza, Clara E. Cabrera-Ríos, Mauricio PLoS One Research Article Identifying genes with the largest expression changes (gene selection) to characterize a given condition is a popular first step to drive exploration into molecular mechanisms and is, therefore, paramount for therapeutic development. Reproducibility in the sciences makes it necessary to emphasize objectivity and systematic repeatability in biological and informatics analyses, including gene selection. With these two characteristics in mind, in previous works our research team has proposed using multiple criteria optimization (MCO) in gene selection to analyze microarray datasets. The result of this effort is the MCO algorithm, which selects genes with the largest expression changes without user manipulation of neither informatics nor statistical parameters. Furthermore, the user is not required to choose either a preference structure among multiple measures or a predetermined quantity of genes to be deemed significant a priori. This implies that using the same datasets and performance measures (PMs), the method will converge to the same set of selected differentially expressed genes (repeatability) despite who carries out the analysis (objectivity). The present work describes the development of an open-source tool in RStudio to enable both: (1) individual analysis of single datasets with two or three PMs and (2) meta-analysis with up to five microarray datasets, using one PM from each dataset. The capabilities afforded by the code include license-free portability and the possibility to carry out analyses via modest computer hardware, such as personal laptops. The code provides affordable, repeatable, and objective detection of differentially expressed genes from microarrays. It can be used to analyze other experiments with similar experimental comparative layouts, such as microRNA arrays and protein arrays, among others. As a demonstration of the capabilities of the code, the analysis of four publicly-available microarray datasets related to Parkinson´s Disease (PD) is presented here, treating each dataset individually or as a four-way meta-analysis. These MCO-supported analyses made it possible to identify MMP9 and TUBB2A as potential PD genetic biomarkers based on their persistent appearance across each of the case studies. A literature search confirmed the importance of these genes in PD and indeed as PD biomarkers, which evidences the code´s potential. Public Library of Science 2022-01-27 /pmc/articles/PMC8794188/ /pubmed/35085348 http://dx.doi.org/10.1371/journal.pone.0262890 Text en © 2022 Narváez-Bandera et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Narváez-Bandera, Isis Suárez-Gómez, Deiver Isaza, Clara E. Cabrera-Ríos, Mauricio Multiple Criteria Optimization (MCO): A gene selection deterministic tool in RStudio |
title | Multiple Criteria Optimization (MCO): A gene selection deterministic tool in RStudio |
title_full | Multiple Criteria Optimization (MCO): A gene selection deterministic tool in RStudio |
title_fullStr | Multiple Criteria Optimization (MCO): A gene selection deterministic tool in RStudio |
title_full_unstemmed | Multiple Criteria Optimization (MCO): A gene selection deterministic tool in RStudio |
title_short | Multiple Criteria Optimization (MCO): A gene selection deterministic tool in RStudio |
title_sort | multiple criteria optimization (mco): a gene selection deterministic tool in rstudio |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8794188/ https://www.ncbi.nlm.nih.gov/pubmed/35085348 http://dx.doi.org/10.1371/journal.pone.0262890 |
work_keys_str_mv | AT narvaezbanderaisis multiplecriteriaoptimizationmcoageneselectiondeterministictoolinrstudio AT suarezgomezdeiver multiplecriteriaoptimizationmcoageneselectiondeterministictoolinrstudio AT isazaclarae multiplecriteriaoptimizationmcoageneselectiondeterministictoolinrstudio AT cabrerariosmauricio multiplecriteriaoptimizationmcoageneselectiondeterministictoolinrstudio |