Cargando…

Building gene co-expression networks using transcriptomics data for systems biology investigations: Comparison of methods using microarray data

Gene co-expression networks (GCN), built using high-throughput gene expression data are fundamental aspects of systems biology. The main aims of this study were to compare two popular approaches to building and analysing GCN. We use real ovine microarray transcriptomics datasets representing four di...

Descripción completa

Detalles Bibliográficos
Autores principales: Kadarmideen, Haja N, Watson-haigh, Nathan S
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Biomedical Informatics 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3489090/
https://www.ncbi.nlm.nih.gov/pubmed/23144540
http://dx.doi.org/10.6026/97320630008855
_version_ 1782248716163350528
author Kadarmideen, Haja N
Watson-haigh, Nathan S
author_facet Kadarmideen, Haja N
Watson-haigh, Nathan S
author_sort Kadarmideen, Haja N
collection PubMed
description Gene co-expression networks (GCN), built using high-throughput gene expression data are fundamental aspects of systems biology. The main aims of this study were to compare two popular approaches to building and analysing GCN. We use real ovine microarray transcriptomics datasets representing four different treatments with Metyrapone, an inhibitor of cortisol biosynthesis. We conducted several microarray quality control checks before applying GCN methods to filtered datasets. Then we compared the outputs of two methods using connectivity as a criterion, as it measures how well a node (gene) is connected within a network. The two GCN construction methods used were, Weighted Gene Co-expression Network Analysis (WGCNA) and Partial Correlation and Information Theory (PCIT) methods. Nodes were ranked based on their connectivity measures in each of the four different networks created by WGCNA and PCIT and node ranks in two methods were compared to identify those nodes which are highly differentially ranked (HDR). A total of 1,017 HDR nodes were identified across one or more of four networks. We investigated HDR nodes by gene enrichment analyses in relation to their biological relevance to phenotypes. We observed that, in contrast to WGCNA method, PCIT algorithm removes many of the edges of the most highly interconnected nodes. Removal of edges of most highly connected nodes or hub genes will have consequences for downstream analyses and biological interpretations. In general, for large GCN construction (with > 20000 genes) access to large computer clusters, particularly those with larger amounts of shared memory is recommended.
format Online
Article
Text
id pubmed-3489090
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Biomedical Informatics
record_format MEDLINE/PubMed
spelling pubmed-34890902012-11-09 Building gene co-expression networks using transcriptomics data for systems biology investigations: Comparison of methods using microarray data Kadarmideen, Haja N Watson-haigh, Nathan S Bioinformation Hypothesis Gene co-expression networks (GCN), built using high-throughput gene expression data are fundamental aspects of systems biology. The main aims of this study were to compare two popular approaches to building and analysing GCN. We use real ovine microarray transcriptomics datasets representing four different treatments with Metyrapone, an inhibitor of cortisol biosynthesis. We conducted several microarray quality control checks before applying GCN methods to filtered datasets. Then we compared the outputs of two methods using connectivity as a criterion, as it measures how well a node (gene) is connected within a network. The two GCN construction methods used were, Weighted Gene Co-expression Network Analysis (WGCNA) and Partial Correlation and Information Theory (PCIT) methods. Nodes were ranked based on their connectivity measures in each of the four different networks created by WGCNA and PCIT and node ranks in two methods were compared to identify those nodes which are highly differentially ranked (HDR). A total of 1,017 HDR nodes were identified across one or more of four networks. We investigated HDR nodes by gene enrichment analyses in relation to their biological relevance to phenotypes. We observed that, in contrast to WGCNA method, PCIT algorithm removes many of the edges of the most highly interconnected nodes. Removal of edges of most highly connected nodes or hub genes will have consequences for downstream analyses and biological interpretations. In general, for large GCN construction (with > 20000 genes) access to large computer clusters, particularly those with larger amounts of shared memory is recommended. Biomedical Informatics 2012-09-21 /pmc/articles/PMC3489090/ /pubmed/23144540 http://dx.doi.org/10.6026/97320630008855 Text en © 2012 Biomedical Informatics This is an open-access article, which permits unrestricted use, distribution, and reproduction in any medium, for non-commercial purposes, provided the original author and source are credited.
spellingShingle Hypothesis
Kadarmideen, Haja N
Watson-haigh, Nathan S
Building gene co-expression networks using transcriptomics data for systems biology investigations: Comparison of methods using microarray data
title Building gene co-expression networks using transcriptomics data for systems biology investigations: Comparison of methods using microarray data
title_full Building gene co-expression networks using transcriptomics data for systems biology investigations: Comparison of methods using microarray data
title_fullStr Building gene co-expression networks using transcriptomics data for systems biology investigations: Comparison of methods using microarray data
title_full_unstemmed Building gene co-expression networks using transcriptomics data for systems biology investigations: Comparison of methods using microarray data
title_short Building gene co-expression networks using transcriptomics data for systems biology investigations: Comparison of methods using microarray data
title_sort building gene co-expression networks using transcriptomics data for systems biology investigations: comparison of methods using microarray data
topic Hypothesis
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3489090/
https://www.ncbi.nlm.nih.gov/pubmed/23144540
http://dx.doi.org/10.6026/97320630008855
work_keys_str_mv AT kadarmideenhajan buildinggenecoexpressionnetworksusingtranscriptomicsdataforsystemsbiologyinvestigationscomparisonofmethodsusingmicroarraydata
AT watsonhaighnathans buildinggenecoexpressionnetworksusingtranscriptomicsdataforsystemsbiologyinvestigationscomparisonofmethodsusingmicroarraydata