Cargando…

Multiple Criteria Optimization (MCO): A gene selection deterministic tool in RStudio

Identifying genes with the largest expression changes (gene selection) to characterize a given condition is a popular first step to drive exploration into molecular mechanisms and is, therefore, paramount for therapeutic development. Reproducibility in the sciences makes it necessary to emphasize ob...

Descripción completa

Detalles Bibliográficos
Autores principales: Narváez-Bandera, Isis, Suárez-Gómez, Deiver, Isaza, Clara E., Cabrera-Ríos, Mauricio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8794188/
https://www.ncbi.nlm.nih.gov/pubmed/35085348
http://dx.doi.org/10.1371/journal.pone.0262890
_version_ 1784640777239920640
author Narváez-Bandera, Isis
Suárez-Gómez, Deiver
Isaza, Clara E.
Cabrera-Ríos, Mauricio
author_facet Narváez-Bandera, Isis
Suárez-Gómez, Deiver
Isaza, Clara E.
Cabrera-Ríos, Mauricio
author_sort Narváez-Bandera, Isis
collection PubMed
description Identifying genes with the largest expression changes (gene selection) to characterize a given condition is a popular first step to drive exploration into molecular mechanisms and is, therefore, paramount for therapeutic development. Reproducibility in the sciences makes it necessary to emphasize objectivity and systematic repeatability in biological and informatics analyses, including gene selection. With these two characteristics in mind, in previous works our research team has proposed using multiple criteria optimization (MCO) in gene selection to analyze microarray datasets. The result of this effort is the MCO algorithm, which selects genes with the largest expression changes without user manipulation of neither informatics nor statistical parameters. Furthermore, the user is not required to choose either a preference structure among multiple measures or a predetermined quantity of genes to be deemed significant a priori. This implies that using the same datasets and performance measures (PMs), the method will converge to the same set of selected differentially expressed genes (repeatability) despite who carries out the analysis (objectivity). The present work describes the development of an open-source tool in RStudio to enable both: (1) individual analysis of single datasets with two or three PMs and (2) meta-analysis with up to five microarray datasets, using one PM from each dataset. The capabilities afforded by the code include license-free portability and the possibility to carry out analyses via modest computer hardware, such as personal laptops. The code provides affordable, repeatable, and objective detection of differentially expressed genes from microarrays. It can be used to analyze other experiments with similar experimental comparative layouts, such as microRNA arrays and protein arrays, among others. As a demonstration of the capabilities of the code, the analysis of four publicly-available microarray datasets related to Parkinson´s Disease (PD) is presented here, treating each dataset individually or as a four-way meta-analysis. These MCO-supported analyses made it possible to identify MMP9 and TUBB2A as potential PD genetic biomarkers based on their persistent appearance across each of the case studies. A literature search confirmed the importance of these genes in PD and indeed as PD biomarkers, which evidences the code´s potential.
format Online
Article
Text
id pubmed-8794188
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-87941882022-01-28 Multiple Criteria Optimization (MCO): A gene selection deterministic tool in RStudio Narváez-Bandera, Isis Suárez-Gómez, Deiver Isaza, Clara E. Cabrera-Ríos, Mauricio PLoS One Research Article Identifying genes with the largest expression changes (gene selection) to characterize a given condition is a popular first step to drive exploration into molecular mechanisms and is, therefore, paramount for therapeutic development. Reproducibility in the sciences makes it necessary to emphasize objectivity and systematic repeatability in biological and informatics analyses, including gene selection. With these two characteristics in mind, in previous works our research team has proposed using multiple criteria optimization (MCO) in gene selection to analyze microarray datasets. The result of this effort is the MCO algorithm, which selects genes with the largest expression changes without user manipulation of neither informatics nor statistical parameters. Furthermore, the user is not required to choose either a preference structure among multiple measures or a predetermined quantity of genes to be deemed significant a priori. This implies that using the same datasets and performance measures (PMs), the method will converge to the same set of selected differentially expressed genes (repeatability) despite who carries out the analysis (objectivity). The present work describes the development of an open-source tool in RStudio to enable both: (1) individual analysis of single datasets with two or three PMs and (2) meta-analysis with up to five microarray datasets, using one PM from each dataset. The capabilities afforded by the code include license-free portability and the possibility to carry out analyses via modest computer hardware, such as personal laptops. The code provides affordable, repeatable, and objective detection of differentially expressed genes from microarrays. It can be used to analyze other experiments with similar experimental comparative layouts, such as microRNA arrays and protein arrays, among others. As a demonstration of the capabilities of the code, the analysis of four publicly-available microarray datasets related to Parkinson´s Disease (PD) is presented here, treating each dataset individually or as a four-way meta-analysis. These MCO-supported analyses made it possible to identify MMP9 and TUBB2A as potential PD genetic biomarkers based on their persistent appearance across each of the case studies. A literature search confirmed the importance of these genes in PD and indeed as PD biomarkers, which evidences the code´s potential. Public Library of Science 2022-01-27 /pmc/articles/PMC8794188/ /pubmed/35085348 http://dx.doi.org/10.1371/journal.pone.0262890 Text en © 2022 Narváez-Bandera et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Narváez-Bandera, Isis
Suárez-Gómez, Deiver
Isaza, Clara E.
Cabrera-Ríos, Mauricio
Multiple Criteria Optimization (MCO): A gene selection deterministic tool in RStudio
title Multiple Criteria Optimization (MCO): A gene selection deterministic tool in RStudio
title_full Multiple Criteria Optimization (MCO): A gene selection deterministic tool in RStudio
title_fullStr Multiple Criteria Optimization (MCO): A gene selection deterministic tool in RStudio
title_full_unstemmed Multiple Criteria Optimization (MCO): A gene selection deterministic tool in RStudio
title_short Multiple Criteria Optimization (MCO): A gene selection deterministic tool in RStudio
title_sort multiple criteria optimization (mco): a gene selection deterministic tool in rstudio
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8794188/
https://www.ncbi.nlm.nih.gov/pubmed/35085348
http://dx.doi.org/10.1371/journal.pone.0262890
work_keys_str_mv AT narvaezbanderaisis multiplecriteriaoptimizationmcoageneselectiondeterministictoolinrstudio
AT suarezgomezdeiver multiplecriteriaoptimizationmcoageneselectiondeterministictoolinrstudio
AT isazaclarae multiplecriteriaoptimizationmcoageneselectiondeterministictoolinrstudio
AT cabrerariosmauricio multiplecriteriaoptimizationmcoageneselectiondeterministictoolinrstudio