Cargando…
Building gene co-expression networks using transcriptomics data for systems biology investigations: Comparison of methods using microarray data
Gene co-expression networks (GCN), built using high-throughput gene expression data are fundamental aspects of systems biology. The main aims of this study were to compare two popular approaches to building and analysing GCN. We use real ovine microarray transcriptomics datasets representing four di...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Biomedical Informatics
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3489090/ https://www.ncbi.nlm.nih.gov/pubmed/23144540 http://dx.doi.org/10.6026/97320630008855 |
_version_ | 1782248716163350528 |
---|---|
author | Kadarmideen, Haja N Watson-haigh, Nathan S |
author_facet | Kadarmideen, Haja N Watson-haigh, Nathan S |
author_sort | Kadarmideen, Haja N |
collection | PubMed |
description | Gene co-expression networks (GCN), built using high-throughput gene expression data are fundamental aspects of systems biology. The main aims of this study were to compare two popular approaches to building and analysing GCN. We use real ovine microarray transcriptomics datasets representing four different treatments with Metyrapone, an inhibitor of cortisol biosynthesis. We conducted several microarray quality control checks before applying GCN methods to filtered datasets. Then we compared the outputs of two methods using connectivity as a criterion, as it measures how well a node (gene) is connected within a network. The two GCN construction methods used were, Weighted Gene Co-expression Network Analysis (WGCNA) and Partial Correlation and Information Theory (PCIT) methods. Nodes were ranked based on their connectivity measures in each of the four different networks created by WGCNA and PCIT and node ranks in two methods were compared to identify those nodes which are highly differentially ranked (HDR). A total of 1,017 HDR nodes were identified across one or more of four networks. We investigated HDR nodes by gene enrichment analyses in relation to their biological relevance to phenotypes. We observed that, in contrast to WGCNA method, PCIT algorithm removes many of the edges of the most highly interconnected nodes. Removal of edges of most highly connected nodes or hub genes will have consequences for downstream analyses and biological interpretations. In general, for large GCN construction (with > 20000 genes) access to large computer clusters, particularly those with larger amounts of shared memory is recommended. |
format | Online Article Text |
id | pubmed-3489090 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Biomedical Informatics |
record_format | MEDLINE/PubMed |
spelling | pubmed-34890902012-11-09 Building gene co-expression networks using transcriptomics data for systems biology investigations: Comparison of methods using microarray data Kadarmideen, Haja N Watson-haigh, Nathan S Bioinformation Hypothesis Gene co-expression networks (GCN), built using high-throughput gene expression data are fundamental aspects of systems biology. The main aims of this study were to compare two popular approaches to building and analysing GCN. We use real ovine microarray transcriptomics datasets representing four different treatments with Metyrapone, an inhibitor of cortisol biosynthesis. We conducted several microarray quality control checks before applying GCN methods to filtered datasets. Then we compared the outputs of two methods using connectivity as a criterion, as it measures how well a node (gene) is connected within a network. The two GCN construction methods used were, Weighted Gene Co-expression Network Analysis (WGCNA) and Partial Correlation and Information Theory (PCIT) methods. Nodes were ranked based on their connectivity measures in each of the four different networks created by WGCNA and PCIT and node ranks in two methods were compared to identify those nodes which are highly differentially ranked (HDR). A total of 1,017 HDR nodes were identified across one or more of four networks. We investigated HDR nodes by gene enrichment analyses in relation to their biological relevance to phenotypes. We observed that, in contrast to WGCNA method, PCIT algorithm removes many of the edges of the most highly interconnected nodes. Removal of edges of most highly connected nodes or hub genes will have consequences for downstream analyses and biological interpretations. In general, for large GCN construction (with > 20000 genes) access to large computer clusters, particularly those with larger amounts of shared memory is recommended. Biomedical Informatics 2012-09-21 /pmc/articles/PMC3489090/ /pubmed/23144540 http://dx.doi.org/10.6026/97320630008855 Text en © 2012 Biomedical Informatics This is an open-access article, which permits unrestricted use, distribution, and reproduction in any medium, for non-commercial purposes, provided the original author and source are credited. |
spellingShingle | Hypothesis Kadarmideen, Haja N Watson-haigh, Nathan S Building gene co-expression networks using transcriptomics data for systems biology investigations: Comparison of methods using microarray data |
title | Building gene co-expression networks using transcriptomics data for systems biology investigations: Comparison of methods using microarray data |
title_full | Building gene co-expression networks using transcriptomics data for systems biology investigations: Comparison of methods using microarray data |
title_fullStr | Building gene co-expression networks using transcriptomics data for systems biology investigations: Comparison of methods using microarray data |
title_full_unstemmed | Building gene co-expression networks using transcriptomics data for systems biology investigations: Comparison of methods using microarray data |
title_short | Building gene co-expression networks using transcriptomics data for systems biology investigations: Comparison of methods using microarray data |
title_sort | building gene co-expression networks using transcriptomics data for systems biology investigations: comparison of methods using microarray data |
topic | Hypothesis |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3489090/ https://www.ncbi.nlm.nih.gov/pubmed/23144540 http://dx.doi.org/10.6026/97320630008855 |
work_keys_str_mv | AT kadarmideenhajan buildinggenecoexpressionnetworksusingtranscriptomicsdataforsystemsbiologyinvestigationscomparisonofmethodsusingmicroarraydata AT watsonhaighnathans buildinggenecoexpressionnetworksusingtranscriptomicsdataforsystemsbiologyinvestigationscomparisonofmethodsusingmicroarraydata |