Cargando…
Comparing de novo transcriptome assembly tools in di- and autotetraploid non-model plant species
BACKGROUND: Polyploidy is very common in plants and can be seen as one of the key drivers in the domestication of crops and the establishment of important agronomic traits. It can be the main source of genomic repatterning and introduces gene duplications, affecting gene expression and alternative s...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7986043/ https://www.ncbi.nlm.nih.gov/pubmed/33752598 http://dx.doi.org/10.1186/s12859-021-04078-8 |
_version_ | 1783668363859329024 |
---|---|
author | Madritsch, Silvia Burg, Agnes Sehr, Eva M. |
author_facet | Madritsch, Silvia Burg, Agnes Sehr, Eva M. |
author_sort | Madritsch, Silvia |
collection | PubMed |
description | BACKGROUND: Polyploidy is very common in plants and can be seen as one of the key drivers in the domestication of crops and the establishment of important agronomic traits. It can be the main source of genomic repatterning and introduces gene duplications, affecting gene expression and alternative splicing. Since fully sequenced genomes are not yet available for many plant species including crops, de novo transcriptome assembly is the basis to understand molecular and functional mechanisms. However, in complex polyploid plants, de novo transcriptome assembly is challenging, leading to increased rates of fused or redundant transcripts. Since assemblers were developed mainly for diploid organisms, they may not well suited for polyploids. Also, comparative evaluations of these tools on higher polyploid plants are extremely rare. Thus, our aim was to fill this gap and to provide a basic guideline for choosing the optimal de novo assembly strategy focusing on autotetraploids, as the scientific interest in this type of polyploidy is steadily increasing. RESULTS: We present a comparison of two common (SOAPdenovo-Trans, Trinity) and one recently published transcriptome assembler (TransLiG) on diploid and autotetraploid species of the genera Acer and Vaccinium using Arabidopsis thaliana as a reference. The number of assembled transcripts was up to 11 and 14 times higher with an increased number of short transcripts for Acer and Vaccinium, respectively, compared to A. thaliana. In diploid samples, Trinity and TransLiG performed similarly good while in autotetraploids, TransLiG assembled most complete transcriptomes with an average of 1916 assembled BUSCOs vs. 1705 BUSCOs for Trinity. Of all three assemblers, SOAPdenovo-Trans performed worst (1133 complete BUSCOs). CONCLUSION: All three assembly tools produced complete assemblies when dealing with the model organism A. thaliana, independently of its ploidy level, but their performances differed extremely when it comes to non-model autotetraploids, where specifically TransLiG and Trinity produced a high number of redundant transcripts. The recently published assembler TransLiG has not been tested yet on any plant organism but showed highest completeness and full-length transcriptomes, especially in autotetraploids. Including such species during the development and testing of new assembly tools is highly appreciated and recommended as many important crops are polyploid. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04078-8. |
format | Online Article Text |
id | pubmed-7986043 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-79860432021-03-24 Comparing de novo transcriptome assembly tools in di- and autotetraploid non-model plant species Madritsch, Silvia Burg, Agnes Sehr, Eva M. BMC Bioinformatics Research Article BACKGROUND: Polyploidy is very common in plants and can be seen as one of the key drivers in the domestication of crops and the establishment of important agronomic traits. It can be the main source of genomic repatterning and introduces gene duplications, affecting gene expression and alternative splicing. Since fully sequenced genomes are not yet available for many plant species including crops, de novo transcriptome assembly is the basis to understand molecular and functional mechanisms. However, in complex polyploid plants, de novo transcriptome assembly is challenging, leading to increased rates of fused or redundant transcripts. Since assemblers were developed mainly for diploid organisms, they may not well suited for polyploids. Also, comparative evaluations of these tools on higher polyploid plants are extremely rare. Thus, our aim was to fill this gap and to provide a basic guideline for choosing the optimal de novo assembly strategy focusing on autotetraploids, as the scientific interest in this type of polyploidy is steadily increasing. RESULTS: We present a comparison of two common (SOAPdenovo-Trans, Trinity) and one recently published transcriptome assembler (TransLiG) on diploid and autotetraploid species of the genera Acer and Vaccinium using Arabidopsis thaliana as a reference. The number of assembled transcripts was up to 11 and 14 times higher with an increased number of short transcripts for Acer and Vaccinium, respectively, compared to A. thaliana. In diploid samples, Trinity and TransLiG performed similarly good while in autotetraploids, TransLiG assembled most complete transcriptomes with an average of 1916 assembled BUSCOs vs. 1705 BUSCOs for Trinity. Of all three assemblers, SOAPdenovo-Trans performed worst (1133 complete BUSCOs). CONCLUSION: All three assembly tools produced complete assemblies when dealing with the model organism A. thaliana, independently of its ploidy level, but their performances differed extremely when it comes to non-model autotetraploids, where specifically TransLiG and Trinity produced a high number of redundant transcripts. The recently published assembler TransLiG has not been tested yet on any plant organism but showed highest completeness and full-length transcriptomes, especially in autotetraploids. Including such species during the development and testing of new assembly tools is highly appreciated and recommended as many important crops are polyploid. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04078-8. BioMed Central 2021-03-22 /pmc/articles/PMC7986043/ /pubmed/33752598 http://dx.doi.org/10.1186/s12859-021-04078-8 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Madritsch, Silvia Burg, Agnes Sehr, Eva M. Comparing de novo transcriptome assembly tools in di- and autotetraploid non-model plant species |
title | Comparing de novo transcriptome assembly tools in di- and autotetraploid non-model plant species |
title_full | Comparing de novo transcriptome assembly tools in di- and autotetraploid non-model plant species |
title_fullStr | Comparing de novo transcriptome assembly tools in di- and autotetraploid non-model plant species |
title_full_unstemmed | Comparing de novo transcriptome assembly tools in di- and autotetraploid non-model plant species |
title_short | Comparing de novo transcriptome assembly tools in di- and autotetraploid non-model plant species |
title_sort | comparing de novo transcriptome assembly tools in di- and autotetraploid non-model plant species |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7986043/ https://www.ncbi.nlm.nih.gov/pubmed/33752598 http://dx.doi.org/10.1186/s12859-021-04078-8 |
work_keys_str_mv | AT madritschsilvia comparingdenovotranscriptomeassemblytoolsindiandautotetraploidnonmodelplantspecies AT burgagnes comparingdenovotranscriptomeassemblytoolsindiandautotetraploidnonmodelplantspecies AT sehrevam comparingdenovotranscriptomeassemblytoolsindiandautotetraploidnonmodelplantspecies |