Cargando…

Multiple Linear Regression for Reconstruction of Gene Regulatory Networks in Solving Cascade Error Problems

Gene regulatory network (GRN) reconstruction is the process of identifying regulatory gene interactions from experimental data through computational analysis. One of the main reasons for the reduced performance of previous GRN methods had been inaccurate prediction of cascade motifs. Cascade error i...

Descripción completa

Detalles Bibliográficos
Autores principales: Salleh, Faridah Hani Mohamed, Zainudin, Suhaila, Arif, Shereena M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5303608/
https://www.ncbi.nlm.nih.gov/pubmed/28250767
http://dx.doi.org/10.1155/2017/4827171
_version_ 1782506738419761152
author Salleh, Faridah Hani Mohamed
Zainudin, Suhaila
Arif, Shereena M.
author_facet Salleh, Faridah Hani Mohamed
Zainudin, Suhaila
Arif, Shereena M.
author_sort Salleh, Faridah Hani Mohamed
collection PubMed
description Gene regulatory network (GRN) reconstruction is the process of identifying regulatory gene interactions from experimental data through computational analysis. One of the main reasons for the reduced performance of previous GRN methods had been inaccurate prediction of cascade motifs. Cascade error is defined as the wrong prediction of cascade motifs, where an indirect interaction is misinterpreted as a direct interaction. Despite the active research on various GRN prediction methods, the discussion on specific methods to solve problems related to cascade errors is still lacking. In fact, the experiments conducted by the past studies were not specifically geared towards proving the ability of GRN prediction methods in avoiding the occurrences of cascade errors. Hence, this research aims to propose Multiple Linear Regression (MLR) to infer GRN from gene expression data and to avoid wrongly inferring of an indirect interaction (A → B → C) as a direct interaction (A → C). Since the number of observations of the real experiment datasets was far less than the number of predictors, some predictors were eliminated by extracting the random subnetworks from global interaction networks via an established extraction method. In addition, the experiment was extended to assess the effectiveness of MLR in dealing with cascade error by using a novel experimental procedure that had been proposed in this work. The experiment revealed that the number of cascade errors had been very minimal. Apart from that, the Belsley collinearity test proved that multicollinearity did affect the datasets used in this experiment greatly. All the tested subnetworks obtained satisfactory results, with AUROC values above 0.5.
format Online
Article
Text
id pubmed-5303608
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-53036082017-03-01 Multiple Linear Regression for Reconstruction of Gene Regulatory Networks in Solving Cascade Error Problems Salleh, Faridah Hani Mohamed Zainudin, Suhaila Arif, Shereena M. Adv Bioinformatics Research Article Gene regulatory network (GRN) reconstruction is the process of identifying regulatory gene interactions from experimental data through computational analysis. One of the main reasons for the reduced performance of previous GRN methods had been inaccurate prediction of cascade motifs. Cascade error is defined as the wrong prediction of cascade motifs, where an indirect interaction is misinterpreted as a direct interaction. Despite the active research on various GRN prediction methods, the discussion on specific methods to solve problems related to cascade errors is still lacking. In fact, the experiments conducted by the past studies were not specifically geared towards proving the ability of GRN prediction methods in avoiding the occurrences of cascade errors. Hence, this research aims to propose Multiple Linear Regression (MLR) to infer GRN from gene expression data and to avoid wrongly inferring of an indirect interaction (A → B → C) as a direct interaction (A → C). Since the number of observations of the real experiment datasets was far less than the number of predictors, some predictors were eliminated by extracting the random subnetworks from global interaction networks via an established extraction method. In addition, the experiment was extended to assess the effectiveness of MLR in dealing with cascade error by using a novel experimental procedure that had been proposed in this work. The experiment revealed that the number of cascade errors had been very minimal. Apart from that, the Belsley collinearity test proved that multicollinearity did affect the datasets used in this experiment greatly. All the tested subnetworks obtained satisfactory results, with AUROC values above 0.5. Hindawi Publishing Corporation 2017 2017-01-29 /pmc/articles/PMC5303608/ /pubmed/28250767 http://dx.doi.org/10.1155/2017/4827171 Text en Copyright © 2017 Faridah Hani Mohamed Salleh et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Salleh, Faridah Hani Mohamed
Zainudin, Suhaila
Arif, Shereena M.
Multiple Linear Regression for Reconstruction of Gene Regulatory Networks in Solving Cascade Error Problems
title Multiple Linear Regression for Reconstruction of Gene Regulatory Networks in Solving Cascade Error Problems
title_full Multiple Linear Regression for Reconstruction of Gene Regulatory Networks in Solving Cascade Error Problems
title_fullStr Multiple Linear Regression for Reconstruction of Gene Regulatory Networks in Solving Cascade Error Problems
title_full_unstemmed Multiple Linear Regression for Reconstruction of Gene Regulatory Networks in Solving Cascade Error Problems
title_short Multiple Linear Regression for Reconstruction of Gene Regulatory Networks in Solving Cascade Error Problems
title_sort multiple linear regression for reconstruction of gene regulatory networks in solving cascade error problems
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5303608/
https://www.ncbi.nlm.nih.gov/pubmed/28250767
http://dx.doi.org/10.1155/2017/4827171
work_keys_str_mv AT sallehfaridahhanimohamed multiplelinearregressionforreconstructionofgeneregulatorynetworksinsolvingcascadeerrorproblems
AT zainudinsuhaila multiplelinearregressionforreconstructionofgeneregulatorynetworksinsolvingcascadeerrorproblems
AT arifshereenam multiplelinearregressionforreconstructionofgeneregulatorynetworksinsolvingcascadeerrorproblems