Cargando…

NetMiner-an ensemble pipeline for building genome-wide and high-quality gene co-expression network using massive-scale RNA-seq samples

Accurately reconstructing gene co-expression network is of great importance for uncovering the genetic architecture underlying complex and various phenotypes. The recent availability of high-throughput RNA-seq sequencing has made genome-wide detecting and quantifying of the novel, rare and low-abund...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Hua, Jiao, Bingke, Lu, Lu, Wang, Pengfei, Chen, Shuangcheng, Liang, Chengzhi, Liu, Wei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5806890/
https://www.ncbi.nlm.nih.gov/pubmed/29425247
http://dx.doi.org/10.1371/journal.pone.0192613
_version_ 1783299189639217152
author Yu, Hua
Jiao, Bingke
Lu, Lu
Wang, Pengfei
Chen, Shuangcheng
Liang, Chengzhi
Liu, Wei
author_facet Yu, Hua
Jiao, Bingke
Lu, Lu
Wang, Pengfei
Chen, Shuangcheng
Liang, Chengzhi
Liu, Wei
author_sort Yu, Hua
collection PubMed
description Accurately reconstructing gene co-expression network is of great importance for uncovering the genetic architecture underlying complex and various phenotypes. The recent availability of high-throughput RNA-seq sequencing has made genome-wide detecting and quantifying of the novel, rare and low-abundance transcripts practical. However, its potential merits in reconstructing gene co-expression network have still not been well explored. Using massive-scale RNA-seq samples, we have designed an ensemble pipeline, called NetMiner, for building genome-scale and high-quality Gene Co-expression Network (GCN) by integrating three frequently used inference algorithms. We constructed a RNA-seq-based GCN in one species of monocot rice. The quality of network obtained by our method was verified and evaluated by the curated gene functional association data sets, which obviously outperformed each single method. In addition, the powerful capability of network for associating genes with functions and agronomic traits was shown by enrichment analysis and case studies. In particular, we demonstrated the potential value of our proposed method to predict the biological roles of unknown protein-coding genes, long non-coding RNA (lncRNA) genes and circular RNA (circRNA) genes. Our results provided a valuable and highly reliable data source to select key candidate genes for subsequent experimental validation. To facilitate identification of novel genes regulating important biological processes and phenotypes in other plants or animals, we have published the source code of NetMiner, making it freely available at https://github.com/czllab/NetMiner.
format Online
Article
Text
id pubmed-5806890
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-58068902018-02-23 NetMiner-an ensemble pipeline for building genome-wide and high-quality gene co-expression network using massive-scale RNA-seq samples Yu, Hua Jiao, Bingke Lu, Lu Wang, Pengfei Chen, Shuangcheng Liang, Chengzhi Liu, Wei PLoS One Research Article Accurately reconstructing gene co-expression network is of great importance for uncovering the genetic architecture underlying complex and various phenotypes. The recent availability of high-throughput RNA-seq sequencing has made genome-wide detecting and quantifying of the novel, rare and low-abundance transcripts practical. However, its potential merits in reconstructing gene co-expression network have still not been well explored. Using massive-scale RNA-seq samples, we have designed an ensemble pipeline, called NetMiner, for building genome-scale and high-quality Gene Co-expression Network (GCN) by integrating three frequently used inference algorithms. We constructed a RNA-seq-based GCN in one species of monocot rice. The quality of network obtained by our method was verified and evaluated by the curated gene functional association data sets, which obviously outperformed each single method. In addition, the powerful capability of network for associating genes with functions and agronomic traits was shown by enrichment analysis and case studies. In particular, we demonstrated the potential value of our proposed method to predict the biological roles of unknown protein-coding genes, long non-coding RNA (lncRNA) genes and circular RNA (circRNA) genes. Our results provided a valuable and highly reliable data source to select key candidate genes for subsequent experimental validation. To facilitate identification of novel genes regulating important biological processes and phenotypes in other plants or animals, we have published the source code of NetMiner, making it freely available at https://github.com/czllab/NetMiner. Public Library of Science 2018-02-09 /pmc/articles/PMC5806890/ /pubmed/29425247 http://dx.doi.org/10.1371/journal.pone.0192613 Text en © 2018 Yu et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Yu, Hua
Jiao, Bingke
Lu, Lu
Wang, Pengfei
Chen, Shuangcheng
Liang, Chengzhi
Liu, Wei
NetMiner-an ensemble pipeline for building genome-wide and high-quality gene co-expression network using massive-scale RNA-seq samples
title NetMiner-an ensemble pipeline for building genome-wide and high-quality gene co-expression network using massive-scale RNA-seq samples
title_full NetMiner-an ensemble pipeline for building genome-wide and high-quality gene co-expression network using massive-scale RNA-seq samples
title_fullStr NetMiner-an ensemble pipeline for building genome-wide and high-quality gene co-expression network using massive-scale RNA-seq samples
title_full_unstemmed NetMiner-an ensemble pipeline for building genome-wide and high-quality gene co-expression network using massive-scale RNA-seq samples
title_short NetMiner-an ensemble pipeline for building genome-wide and high-quality gene co-expression network using massive-scale RNA-seq samples
title_sort netminer-an ensemble pipeline for building genome-wide and high-quality gene co-expression network using massive-scale rna-seq samples
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5806890/
https://www.ncbi.nlm.nih.gov/pubmed/29425247
http://dx.doi.org/10.1371/journal.pone.0192613
work_keys_str_mv AT yuhua netmineranensemblepipelineforbuildinggenomewideandhighqualitygenecoexpressionnetworkusingmassivescalernaseqsamples
AT jiaobingke netmineranensemblepipelineforbuildinggenomewideandhighqualitygenecoexpressionnetworkusingmassivescalernaseqsamples
AT lulu netmineranensemblepipelineforbuildinggenomewideandhighqualitygenecoexpressionnetworkusingmassivescalernaseqsamples
AT wangpengfei netmineranensemblepipelineforbuildinggenomewideandhighqualitygenecoexpressionnetworkusingmassivescalernaseqsamples
AT chenshuangcheng netmineranensemblepipelineforbuildinggenomewideandhighqualitygenecoexpressionnetworkusingmassivescalernaseqsamples
AT liangchengzhi netmineranensemblepipelineforbuildinggenomewideandhighqualitygenecoexpressionnetworkusingmassivescalernaseqsamples
AT liuwei netmineranensemblepipelineforbuildinggenomewideandhighqualitygenecoexpressionnetworkusingmassivescalernaseqsamples