Cargando…

Robust pathway sampling in phenotype prediction. Application to triple negative breast cancer

BACKGROUND: Phenotype prediction problems are usually considered ill-posed, as the amount of samples is very limited with respect to the scrutinized genetic probes. This fact complicates the sampling of the defective genetic pathways due to the high number of possible discriminatory genetic networks...

Descripción completa

Detalles Bibliográficos
Autores principales: Cernea, Ana, Fernández-Martínez, Juan Luis, deAndrés-Galiana, Enrique J., Fernández-Ovies, Francisco Javier, Alvarez-Machancoses, Oscar, Fernández-Muñiz, Zulima, Saligan, Leorey N., Sonis, Stephen T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7068866/
https://www.ncbi.nlm.nih.gov/pubmed/32164540
http://dx.doi.org/10.1186/s12859-020-3356-6
_version_ 1783505659288879104
author Cernea, Ana
Fernández-Martínez, Juan Luis
deAndrés-Galiana, Enrique J.
Fernández-Ovies, Francisco Javier
Alvarez-Machancoses, Oscar
Fernández-Muñiz, Zulima
Saligan, Leorey N.
Sonis, Stephen T.
author_facet Cernea, Ana
Fernández-Martínez, Juan Luis
deAndrés-Galiana, Enrique J.
Fernández-Ovies, Francisco Javier
Alvarez-Machancoses, Oscar
Fernández-Muñiz, Zulima
Saligan, Leorey N.
Sonis, Stephen T.
author_sort Cernea, Ana
collection PubMed
description BACKGROUND: Phenotype prediction problems are usually considered ill-posed, as the amount of samples is very limited with respect to the scrutinized genetic probes. This fact complicates the sampling of the defective genetic pathways due to the high number of possible discriminatory genetic networks involved. In this research, we outline three novel sampling algorithms utilized to identify, classify and characterize the defective pathways in phenotype prediction problems, such as the Fisher’s ratio sampler, the Holdout sampler and the Random sampler, and apply each one to the analysis of genetic pathways involved in tumor behavior and outcomes of triple negative breast cancers (TNBC). Altered biological pathways are identified using the most frequently sampled genes and are compared to those obtained via Bayesian Networks (BNs). RESULTS: Random, Fisher’s ratio and Holdout samplers were more accurate and robust than BNs, while providing comparable insights about disease genomics. CONCLUSIONS: The three samplers tested are good alternatives to Bayesian Networks since they are less computationally demanding algorithms. Importantly, this analysis confirms the concept of “biological invariance” since the altered pathways should be independent of the sampling methodology and the classifier used for their inference. Nevertheless, still some modifications are needed in the Bayesian networks to be able to sample correctly the uncertainty space in phenotype prediction problems, since the probabilistic parameterization of the uncertainty space is not unique and the use of the optimum network might falsify the pathways analysis.
format Online
Article
Text
id pubmed-7068866
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-70688662020-03-18 Robust pathway sampling in phenotype prediction. Application to triple negative breast cancer Cernea, Ana Fernández-Martínez, Juan Luis deAndrés-Galiana, Enrique J. Fernández-Ovies, Francisco Javier Alvarez-Machancoses, Oscar Fernández-Muñiz, Zulima Saligan, Leorey N. Sonis, Stephen T. BMC Bioinformatics Research BACKGROUND: Phenotype prediction problems are usually considered ill-posed, as the amount of samples is very limited with respect to the scrutinized genetic probes. This fact complicates the sampling of the defective genetic pathways due to the high number of possible discriminatory genetic networks involved. In this research, we outline three novel sampling algorithms utilized to identify, classify and characterize the defective pathways in phenotype prediction problems, such as the Fisher’s ratio sampler, the Holdout sampler and the Random sampler, and apply each one to the analysis of genetic pathways involved in tumor behavior and outcomes of triple negative breast cancers (TNBC). Altered biological pathways are identified using the most frequently sampled genes and are compared to those obtained via Bayesian Networks (BNs). RESULTS: Random, Fisher’s ratio and Holdout samplers were more accurate and robust than BNs, while providing comparable insights about disease genomics. CONCLUSIONS: The three samplers tested are good alternatives to Bayesian Networks since they are less computationally demanding algorithms. Importantly, this analysis confirms the concept of “biological invariance” since the altered pathways should be independent of the sampling methodology and the classifier used for their inference. Nevertheless, still some modifications are needed in the Bayesian networks to be able to sample correctly the uncertainty space in phenotype prediction problems, since the probabilistic parameterization of the uncertainty space is not unique and the use of the optimum network might falsify the pathways analysis. BioMed Central 2020-03-11 /pmc/articles/PMC7068866/ /pubmed/32164540 http://dx.doi.org/10.1186/s12859-020-3356-6 Text en © The Author(s). 2020 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Cernea, Ana
Fernández-Martínez, Juan Luis
deAndrés-Galiana, Enrique J.
Fernández-Ovies, Francisco Javier
Alvarez-Machancoses, Oscar
Fernández-Muñiz, Zulima
Saligan, Leorey N.
Sonis, Stephen T.
Robust pathway sampling in phenotype prediction. Application to triple negative breast cancer
title Robust pathway sampling in phenotype prediction. Application to triple negative breast cancer
title_full Robust pathway sampling in phenotype prediction. Application to triple negative breast cancer
title_fullStr Robust pathway sampling in phenotype prediction. Application to triple negative breast cancer
title_full_unstemmed Robust pathway sampling in phenotype prediction. Application to triple negative breast cancer
title_short Robust pathway sampling in phenotype prediction. Application to triple negative breast cancer
title_sort robust pathway sampling in phenotype prediction. application to triple negative breast cancer
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7068866/
https://www.ncbi.nlm.nih.gov/pubmed/32164540
http://dx.doi.org/10.1186/s12859-020-3356-6
work_keys_str_mv AT cerneaana robustpathwaysamplinginphenotypepredictionapplicationtotriplenegativebreastcancer
AT fernandezmartinezjuanluis robustpathwaysamplinginphenotypepredictionapplicationtotriplenegativebreastcancer
AT deandresgalianaenriquej robustpathwaysamplinginphenotypepredictionapplicationtotriplenegativebreastcancer
AT fernandezoviesfranciscojavier robustpathwaysamplinginphenotypepredictionapplicationtotriplenegativebreastcancer
AT alvarezmachancosesoscar robustpathwaysamplinginphenotypepredictionapplicationtotriplenegativebreastcancer
AT fernandezmunizzulima robustpathwaysamplinginphenotypepredictionapplicationtotriplenegativebreastcancer
AT saliganleoreyn robustpathwaysamplinginphenotypepredictionapplicationtotriplenegativebreastcancer
AT sonisstephent robustpathwaysamplinginphenotypepredictionapplicationtotriplenegativebreastcancer