Cargando…

Homeolog expression quantification methods for allopolyploids

Genome duplication with hybridization, or allopolyploidization, occurs in animals, fungi and plants, and is especially common in crop plants. There is an increasing interest in the study of allopolyploids because of advances in polyploid genome assembly; however, the high level of sequence similarit...

Descripción completa

Detalles Bibliográficos
Autores principales: Kuo, Tony C Y, Hatakeyama, Masaomi, Tameshige, Toshiaki, Shimizu, Kentaro K, Sese, Jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7299288/
https://www.ncbi.nlm.nih.gov/pubmed/30590436
http://dx.doi.org/10.1093/bib/bby121
_version_ 1783547356592996352
author Kuo, Tony C Y
Hatakeyama, Masaomi
Tameshige, Toshiaki
Shimizu, Kentaro K
Sese, Jun
author_facet Kuo, Tony C Y
Hatakeyama, Masaomi
Tameshige, Toshiaki
Shimizu, Kentaro K
Sese, Jun
author_sort Kuo, Tony C Y
collection PubMed
description Genome duplication with hybridization, or allopolyploidization, occurs in animals, fungi and plants, and is especially common in crop plants. There is an increasing interest in the study of allopolyploids because of advances in polyploid genome assembly; however, the high level of sequence similarity in duplicated gene copies (homeologs) poses many challenges. Here we compared standard RNA-seq expression quantification approaches used currently for diploid species against subgenome-classification approaches which maps reads to each subgenome separately. We examined mapping error using our previous and new RNA-seq data in which a subgenome is experimentally added (synthetic allotetraploid Arabidopsis kamchatica) or reduced (allohexaploid wheat Triticum aestivum versus extracted allotetraploid) as ground truth. The error rates in the two species were very similar. The standard approaches showed higher error rates (>10% using pseudo-alignment with Kallisto) while subgenome-classification approaches showed much lower error rates (<1% using EAGLE-RC, <2% using HomeoRoq). Although downstream analysis may partly mitigate mapping errors, the difference in methods was substantial in hexaploid wheat, where Kallisto appeared to have systematic differences relative to other methods. Only approximately half of the differentially expressed homeologs detected using Kallisto overlapped with those by any other method in wheat. In general, disagreement in low-expression genes was responsible for most of the discordance between methods, which is consistent with known biases in Kallisto. We also observed that there exist uncertainties in genome sequences and annotation which can affect each method differently. Overall, subgenome-classification approaches tend to perform better than standard approaches with EAGLE-RC having the highest precision.
format Online
Article
Text
id pubmed-7299288
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-72992882020-06-22 Homeolog expression quantification methods for allopolyploids Kuo, Tony C Y Hatakeyama, Masaomi Tameshige, Toshiaki Shimizu, Kentaro K Sese, Jun Brief Bioinform Review Article Genome duplication with hybridization, or allopolyploidization, occurs in animals, fungi and plants, and is especially common in crop plants. There is an increasing interest in the study of allopolyploids because of advances in polyploid genome assembly; however, the high level of sequence similarity in duplicated gene copies (homeologs) poses many challenges. Here we compared standard RNA-seq expression quantification approaches used currently for diploid species against subgenome-classification approaches which maps reads to each subgenome separately. We examined mapping error using our previous and new RNA-seq data in which a subgenome is experimentally added (synthetic allotetraploid Arabidopsis kamchatica) or reduced (allohexaploid wheat Triticum aestivum versus extracted allotetraploid) as ground truth. The error rates in the two species were very similar. The standard approaches showed higher error rates (>10% using pseudo-alignment with Kallisto) while subgenome-classification approaches showed much lower error rates (<1% using EAGLE-RC, <2% using HomeoRoq). Although downstream analysis may partly mitigate mapping errors, the difference in methods was substantial in hexaploid wheat, where Kallisto appeared to have systematic differences relative to other methods. Only approximately half of the differentially expressed homeologs detected using Kallisto overlapped with those by any other method in wheat. In general, disagreement in low-expression genes was responsible for most of the discordance between methods, which is consistent with known biases in Kallisto. We also observed that there exist uncertainties in genome sequences and annotation which can affect each method differently. Overall, subgenome-classification approaches tend to perform better than standard approaches with EAGLE-RC having the highest precision. Oxford University Press 2018-12-27 /pmc/articles/PMC7299288/ /pubmed/30590436 http://dx.doi.org/10.1093/bib/bby121 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Review Article
Kuo, Tony C Y
Hatakeyama, Masaomi
Tameshige, Toshiaki
Shimizu, Kentaro K
Sese, Jun
Homeolog expression quantification methods for allopolyploids
title Homeolog expression quantification methods for allopolyploids
title_full Homeolog expression quantification methods for allopolyploids
title_fullStr Homeolog expression quantification methods for allopolyploids
title_full_unstemmed Homeolog expression quantification methods for allopolyploids
title_short Homeolog expression quantification methods for allopolyploids
title_sort homeolog expression quantification methods for allopolyploids
topic Review Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7299288/
https://www.ncbi.nlm.nih.gov/pubmed/30590436
http://dx.doi.org/10.1093/bib/bby121
work_keys_str_mv AT kuotonycy homeologexpressionquantificationmethodsforallopolyploids
AT hatakeyamamasaomi homeologexpressionquantificationmethodsforallopolyploids
AT tameshigetoshiaki homeologexpressionquantificationmethodsforallopolyploids
AT shimizukentarok homeologexpressionquantificationmethodsforallopolyploids
AT sesejun homeologexpressionquantificationmethodsforallopolyploids