Cargando…
Homoeologous gene expression and co-expression network analyses and evolutionary inference in allopolyploids
Polyploidy is a widespread phenomenon throughout eukaryotes. Due to the coexistence of duplicated genomes, polyploids offer unique challenges for estimating gene expression levels, which is essential for understanding the massive and various forms of transcriptomic responses accompanying polyploidy....
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7986634/ https://www.ncbi.nlm.nih.gov/pubmed/32219306 http://dx.doi.org/10.1093/bib/bbaa035 |
_version_ | 1783668480451543040 |
---|---|
author | Hu, Guanjing Grover, Corrinne E Arick, Mark A Liu, Meiling Peterson, Daniel G Wendel, Jonathan F |
author_facet | Hu, Guanjing Grover, Corrinne E Arick, Mark A Liu, Meiling Peterson, Daniel G Wendel, Jonathan F |
author_sort | Hu, Guanjing |
collection | PubMed |
description | Polyploidy is a widespread phenomenon throughout eukaryotes. Due to the coexistence of duplicated genomes, polyploids offer unique challenges for estimating gene expression levels, which is essential for understanding the massive and various forms of transcriptomic responses accompanying polyploidy. Although previous studies have explored the bioinformatics of polyploid transcriptomic profiling, the causes and consequences of inaccurate quantification of transcripts from duplicated gene copies have not been addressed. Using transcriptomic data from the cotton genus (Gossypium) as an example, we present an analytical workflow to evaluate a variety of bioinformatic method choices at different stages of RNA-seq analysis, from homoeolog expression quantification to downstream analysis used to infer key phenomena of polyploid expression evolution. In general, EAGLE-RC and GSNAP-PolyCat outperform other quantification pipelines tested, and their derived expression dataset best represents the expected homoeolog expression and co-expression divergence. The performance of co-expression network analysis was less affected by homoeolog quantification than by network construction methods, where weighted networks outperformed binary networks. By examining the extent and consequences of homoeolog read ambiguity, we illuminate the potential artifacts that may affect our understanding of duplicate gene expression, including an overestimation of homoeolog co-regulation and the incorrect inference of subgenome asymmetry in network topology. Taken together, our work points to a set of reasonable practices that we hope are broadly applicable to the evolutionary exploration of polyploids. |
format | Online Article Text |
id | pubmed-7986634 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-79866342021-03-26 Homoeologous gene expression and co-expression network analyses and evolutionary inference in allopolyploids Hu, Guanjing Grover, Corrinne E Arick, Mark A Liu, Meiling Peterson, Daniel G Wendel, Jonathan F Brief Bioinform Method Review Polyploidy is a widespread phenomenon throughout eukaryotes. Due to the coexistence of duplicated genomes, polyploids offer unique challenges for estimating gene expression levels, which is essential for understanding the massive and various forms of transcriptomic responses accompanying polyploidy. Although previous studies have explored the bioinformatics of polyploid transcriptomic profiling, the causes and consequences of inaccurate quantification of transcripts from duplicated gene copies have not been addressed. Using transcriptomic data from the cotton genus (Gossypium) as an example, we present an analytical workflow to evaluate a variety of bioinformatic method choices at different stages of RNA-seq analysis, from homoeolog expression quantification to downstream analysis used to infer key phenomena of polyploid expression evolution. In general, EAGLE-RC and GSNAP-PolyCat outperform other quantification pipelines tested, and their derived expression dataset best represents the expected homoeolog expression and co-expression divergence. The performance of co-expression network analysis was less affected by homoeolog quantification than by network construction methods, where weighted networks outperformed binary networks. By examining the extent and consequences of homoeolog read ambiguity, we illuminate the potential artifacts that may affect our understanding of duplicate gene expression, including an overestimation of homoeolog co-regulation and the incorrect inference of subgenome asymmetry in network topology. Taken together, our work points to a set of reasonable practices that we hope are broadly applicable to the evolutionary exploration of polyploids. Oxford University Press 2020-03-27 /pmc/articles/PMC7986634/ /pubmed/32219306 http://dx.doi.org/10.1093/bib/bbaa035 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Method Review Hu, Guanjing Grover, Corrinne E Arick, Mark A Liu, Meiling Peterson, Daniel G Wendel, Jonathan F Homoeologous gene expression and co-expression network analyses and evolutionary inference in allopolyploids |
title | Homoeologous gene expression and co-expression network analyses and evolutionary inference in allopolyploids |
title_full | Homoeologous gene expression and co-expression network analyses and evolutionary inference in allopolyploids |
title_fullStr | Homoeologous gene expression and co-expression network analyses and evolutionary inference in allopolyploids |
title_full_unstemmed | Homoeologous gene expression and co-expression network analyses and evolutionary inference in allopolyploids |
title_short | Homoeologous gene expression and co-expression network analyses and evolutionary inference in allopolyploids |
title_sort | homoeologous gene expression and co-expression network analyses and evolutionary inference in allopolyploids |
topic | Method Review |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7986634/ https://www.ncbi.nlm.nih.gov/pubmed/32219306 http://dx.doi.org/10.1093/bib/bbaa035 |
work_keys_str_mv | AT huguanjing homoeologousgeneexpressionandcoexpressionnetworkanalysesandevolutionaryinferenceinallopolyploids AT grovercorrinnee homoeologousgeneexpressionandcoexpressionnetworkanalysesandevolutionaryinferenceinallopolyploids AT arickmarka homoeologousgeneexpressionandcoexpressionnetworkanalysesandevolutionaryinferenceinallopolyploids AT liumeiling homoeologousgeneexpressionandcoexpressionnetworkanalysesandevolutionaryinferenceinallopolyploids AT petersondanielg homoeologousgeneexpressionandcoexpressionnetworkanalysesandevolutionaryinferenceinallopolyploids AT wendeljonathanf homoeologousgeneexpressionandcoexpressionnetworkanalysesandevolutionaryinferenceinallopolyploids |