Cargando…

Homoeologous gene expression and co-expression network analyses and evolutionary inference in allopolyploids

Polyploidy is a widespread phenomenon throughout eukaryotes. Due to the coexistence of duplicated genomes, polyploids offer unique challenges for estimating gene expression levels, which is essential for understanding the massive and various forms of transcriptomic responses accompanying polyploidy....

Descripción completa

Detalles Bibliográficos
Autores principales: Hu, Guanjing, Grover, Corrinne E, Arick, Mark A, Liu, Meiling, Peterson, Daniel G, Wendel, Jonathan F
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7986634/
https://www.ncbi.nlm.nih.gov/pubmed/32219306
http://dx.doi.org/10.1093/bib/bbaa035
_version_ 1783668480451543040
author Hu, Guanjing
Grover, Corrinne E
Arick, Mark A
Liu, Meiling
Peterson, Daniel G
Wendel, Jonathan F
author_facet Hu, Guanjing
Grover, Corrinne E
Arick, Mark A
Liu, Meiling
Peterson, Daniel G
Wendel, Jonathan F
author_sort Hu, Guanjing
collection PubMed
description Polyploidy is a widespread phenomenon throughout eukaryotes. Due to the coexistence of duplicated genomes, polyploids offer unique challenges for estimating gene expression levels, which is essential for understanding the massive and various forms of transcriptomic responses accompanying polyploidy. Although previous studies have explored the bioinformatics of polyploid transcriptomic profiling, the causes and consequences of inaccurate quantification of transcripts from duplicated gene copies have not been addressed. Using transcriptomic data from the cotton genus (Gossypium) as an example, we present an analytical workflow to evaluate a variety of bioinformatic method choices at different stages of RNA-seq analysis, from homoeolog expression quantification to downstream analysis used to infer key phenomena of polyploid expression evolution. In general, EAGLE-RC and GSNAP-PolyCat outperform other quantification pipelines tested, and their derived expression dataset best represents the expected homoeolog expression and co-expression divergence. The performance of co-expression network analysis was less affected by homoeolog quantification than by network construction methods, where weighted networks outperformed binary networks. By examining the extent and consequences of homoeolog read ambiguity, we illuminate the potential artifacts that may affect our understanding of duplicate gene expression, including an overestimation of homoeolog co-regulation and the incorrect inference of subgenome asymmetry in network topology. Taken together, our work points to a set of reasonable practices that we hope are broadly applicable to the evolutionary exploration of polyploids.
format Online
Article
Text
id pubmed-7986634
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-79866342021-03-26 Homoeologous gene expression and co-expression network analyses and evolutionary inference in allopolyploids Hu, Guanjing Grover, Corrinne E Arick, Mark A Liu, Meiling Peterson, Daniel G Wendel, Jonathan F Brief Bioinform Method Review Polyploidy is a widespread phenomenon throughout eukaryotes. Due to the coexistence of duplicated genomes, polyploids offer unique challenges for estimating gene expression levels, which is essential for understanding the massive and various forms of transcriptomic responses accompanying polyploidy. Although previous studies have explored the bioinformatics of polyploid transcriptomic profiling, the causes and consequences of inaccurate quantification of transcripts from duplicated gene copies have not been addressed. Using transcriptomic data from the cotton genus (Gossypium) as an example, we present an analytical workflow to evaluate a variety of bioinformatic method choices at different stages of RNA-seq analysis, from homoeolog expression quantification to downstream analysis used to infer key phenomena of polyploid expression evolution. In general, EAGLE-RC and GSNAP-PolyCat outperform other quantification pipelines tested, and their derived expression dataset best represents the expected homoeolog expression and co-expression divergence. The performance of co-expression network analysis was less affected by homoeolog quantification than by network construction methods, where weighted networks outperformed binary networks. By examining the extent and consequences of homoeolog read ambiguity, we illuminate the potential artifacts that may affect our understanding of duplicate gene expression, including an overestimation of homoeolog co-regulation and the incorrect inference of subgenome asymmetry in network topology. Taken together, our work points to a set of reasonable practices that we hope are broadly applicable to the evolutionary exploration of polyploids. Oxford University Press 2020-03-27 /pmc/articles/PMC7986634/ /pubmed/32219306 http://dx.doi.org/10.1093/bib/bbaa035 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Method Review
Hu, Guanjing
Grover, Corrinne E
Arick, Mark A
Liu, Meiling
Peterson, Daniel G
Wendel, Jonathan F
Homoeologous gene expression and co-expression network analyses and evolutionary inference in allopolyploids
title Homoeologous gene expression and co-expression network analyses and evolutionary inference in allopolyploids
title_full Homoeologous gene expression and co-expression network analyses and evolutionary inference in allopolyploids
title_fullStr Homoeologous gene expression and co-expression network analyses and evolutionary inference in allopolyploids
title_full_unstemmed Homoeologous gene expression and co-expression network analyses and evolutionary inference in allopolyploids
title_short Homoeologous gene expression and co-expression network analyses and evolutionary inference in allopolyploids
title_sort homoeologous gene expression and co-expression network analyses and evolutionary inference in allopolyploids
topic Method Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7986634/
https://www.ncbi.nlm.nih.gov/pubmed/32219306
http://dx.doi.org/10.1093/bib/bbaa035
work_keys_str_mv AT huguanjing homoeologousgeneexpressionandcoexpressionnetworkanalysesandevolutionaryinferenceinallopolyploids
AT grovercorrinnee homoeologousgeneexpressionandcoexpressionnetworkanalysesandevolutionaryinferenceinallopolyploids
AT arickmarka homoeologousgeneexpressionandcoexpressionnetworkanalysesandevolutionaryinferenceinallopolyploids
AT liumeiling homoeologousgeneexpressionandcoexpressionnetworkanalysesandevolutionaryinferenceinallopolyploids
AT petersondanielg homoeologousgeneexpressionandcoexpressionnetworkanalysesandevolutionaryinferenceinallopolyploids
AT wendeljonathanf homoeologousgeneexpressionandcoexpressionnetworkanalysesandevolutionaryinferenceinallopolyploids