Cargando…

Comparing de novo transcriptome assembly tools in di- and autotetraploid non-model plant species

BACKGROUND: Polyploidy is very common in plants and can be seen as one of the key drivers in the domestication of crops and the establishment of important agronomic traits. It can be the main source of genomic repatterning and introduces gene duplications, affecting gene expression and alternative s...

Descripción completa

Detalles Bibliográficos
Autores principales: Madritsch, Silvia, Burg, Agnes, Sehr, Eva M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7986043/
https://www.ncbi.nlm.nih.gov/pubmed/33752598
http://dx.doi.org/10.1186/s12859-021-04078-8
_version_ 1783668363859329024
author Madritsch, Silvia
Burg, Agnes
Sehr, Eva M.
author_facet Madritsch, Silvia
Burg, Agnes
Sehr, Eva M.
author_sort Madritsch, Silvia
collection PubMed
description BACKGROUND: Polyploidy is very common in plants and can be seen as one of the key drivers in the domestication of crops and the establishment of important agronomic traits. It can be the main source of genomic repatterning and introduces gene duplications, affecting gene expression and alternative splicing. Since fully sequenced genomes are not yet available for many plant species including crops, de novo transcriptome assembly is the basis to understand molecular and functional mechanisms. However, in complex polyploid plants, de novo transcriptome assembly is challenging, leading to increased rates of fused or redundant transcripts. Since assemblers were developed mainly for diploid organisms, they may not well suited for polyploids. Also, comparative evaluations of these tools on higher polyploid plants are extremely rare. Thus, our aim was to fill this gap and to provide a basic guideline for choosing the optimal de novo assembly strategy focusing on autotetraploids, as the scientific interest in this type of polyploidy is steadily increasing. RESULTS: We present a comparison of two common (SOAPdenovo-Trans, Trinity) and one recently published transcriptome assembler (TransLiG) on diploid and autotetraploid species of the genera Acer and Vaccinium using Arabidopsis thaliana as a reference. The number of assembled transcripts was up to 11 and 14 times higher with an increased number of short transcripts for Acer and Vaccinium, respectively, compared to A. thaliana. In diploid samples, Trinity and TransLiG performed similarly good while in autotetraploids, TransLiG assembled most complete transcriptomes with an average of 1916 assembled BUSCOs vs. 1705 BUSCOs for Trinity. Of all three assemblers, SOAPdenovo-Trans performed worst (1133 complete BUSCOs). CONCLUSION: All three assembly tools produced complete assemblies when dealing with the model organism A. thaliana, independently of its ploidy level, but their performances differed extremely when it comes to non-model autotetraploids, where specifically TransLiG and Trinity produced a high number of redundant transcripts. The recently published assembler TransLiG has not been tested yet on any plant organism but showed highest completeness and full-length transcriptomes, especially in autotetraploids. Including such species during the development and testing of new assembly tools is highly appreciated and recommended as many important crops are polyploid. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04078-8.
format Online
Article
Text
id pubmed-7986043
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-79860432021-03-24 Comparing de novo transcriptome assembly tools in di- and autotetraploid non-model plant species Madritsch, Silvia Burg, Agnes Sehr, Eva M. BMC Bioinformatics Research Article BACKGROUND: Polyploidy is very common in plants and can be seen as one of the key drivers in the domestication of crops and the establishment of important agronomic traits. It can be the main source of genomic repatterning and introduces gene duplications, affecting gene expression and alternative splicing. Since fully sequenced genomes are not yet available for many plant species including crops, de novo transcriptome assembly is the basis to understand molecular and functional mechanisms. However, in complex polyploid plants, de novo transcriptome assembly is challenging, leading to increased rates of fused or redundant transcripts. Since assemblers were developed mainly for diploid organisms, they may not well suited for polyploids. Also, comparative evaluations of these tools on higher polyploid plants are extremely rare. Thus, our aim was to fill this gap and to provide a basic guideline for choosing the optimal de novo assembly strategy focusing on autotetraploids, as the scientific interest in this type of polyploidy is steadily increasing. RESULTS: We present a comparison of two common (SOAPdenovo-Trans, Trinity) and one recently published transcriptome assembler (TransLiG) on diploid and autotetraploid species of the genera Acer and Vaccinium using Arabidopsis thaliana as a reference. The number of assembled transcripts was up to 11 and 14 times higher with an increased number of short transcripts for Acer and Vaccinium, respectively, compared to A. thaliana. In diploid samples, Trinity and TransLiG performed similarly good while in autotetraploids, TransLiG assembled most complete transcriptomes with an average of 1916 assembled BUSCOs vs. 1705 BUSCOs for Trinity. Of all three assemblers, SOAPdenovo-Trans performed worst (1133 complete BUSCOs). CONCLUSION: All three assembly tools produced complete assemblies when dealing with the model organism A. thaliana, independently of its ploidy level, but their performances differed extremely when it comes to non-model autotetraploids, where specifically TransLiG and Trinity produced a high number of redundant transcripts. The recently published assembler TransLiG has not been tested yet on any plant organism but showed highest completeness and full-length transcriptomes, especially in autotetraploids. Including such species during the development and testing of new assembly tools is highly appreciated and recommended as many important crops are polyploid. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04078-8. BioMed Central 2021-03-22 /pmc/articles/PMC7986043/ /pubmed/33752598 http://dx.doi.org/10.1186/s12859-021-04078-8 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Madritsch, Silvia
Burg, Agnes
Sehr, Eva M.
Comparing de novo transcriptome assembly tools in di- and autotetraploid non-model plant species
title Comparing de novo transcriptome assembly tools in di- and autotetraploid non-model plant species
title_full Comparing de novo transcriptome assembly tools in di- and autotetraploid non-model plant species
title_fullStr Comparing de novo transcriptome assembly tools in di- and autotetraploid non-model plant species
title_full_unstemmed Comparing de novo transcriptome assembly tools in di- and autotetraploid non-model plant species
title_short Comparing de novo transcriptome assembly tools in di- and autotetraploid non-model plant species
title_sort comparing de novo transcriptome assembly tools in di- and autotetraploid non-model plant species
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7986043/
https://www.ncbi.nlm.nih.gov/pubmed/33752598
http://dx.doi.org/10.1186/s12859-021-04078-8
work_keys_str_mv AT madritschsilvia comparingdenovotranscriptomeassemblytoolsindiandautotetraploidnonmodelplantspecies
AT burgagnes comparingdenovotranscriptomeassemblytoolsindiandautotetraploidnonmodelplantspecies
AT sehrevam comparingdenovotranscriptomeassemblytoolsindiandautotetraploidnonmodelplantspecies