Cargando…

Construction of a de novo assembly pipeline using multiple transcriptome data sets from Cypripedium macranthos (Orchidaceae)

The family Orchidaceae comprises the most species of any monocotyledonous family and has interesting characteristics such as seed germination induced by mycorrhizal fungi and flower morphology that co-adapted with pollinators. In orchid species, genomes have been decoded for only a few horticultural...

Descripción completa

Detalles Bibliográficos
Autores principales: Kambara, Kota, Fujino, Kaien, Shimura, Hanako
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10243633/
https://www.ncbi.nlm.nih.gov/pubmed/37279244
http://dx.doi.org/10.1371/journal.pone.0286804
_version_ 1785054463574147072
author Kambara, Kota
Fujino, Kaien
Shimura, Hanako
author_facet Kambara, Kota
Fujino, Kaien
Shimura, Hanako
author_sort Kambara, Kota
collection PubMed
description The family Orchidaceae comprises the most species of any monocotyledonous family and has interesting characteristics such as seed germination induced by mycorrhizal fungi and flower morphology that co-adapted with pollinators. In orchid species, genomes have been decoded for only a few horticultural species, and there is little genetic information available. Generally, for species lacking sequenced genomes, gene sequences are predicted by de novo assembly of transcriptome data. Here, we devised a de novo assembly pipeline for transcriptome data from the wild orchid Cypripedium (lady slipper orchid) in Japan by mixing multiple data sets and integrating assemblies to create a more complete and less redundant contig set. Among the assemblies generated by combining various assemblers, Trinity and IDBA-Tran yielded good assembly with higher mapping rates and percentages of BLAST hit contigs and complete BUSCO. Using this contig set as a reference, we analyzed differential gene expression between protocorms grown aseptically or with mycorrhizal fungi to detect gene expressions required for mycorrhizal interaction. A pipeline proposed in this study can construct a highly reliable contig set with little redundancy even when multiple transcriptome data are mixed, and can provide a reference that is adaptable to DEG analysis and other downstream analysis in RNA-seq.
format Online
Article
Text
id pubmed-10243633
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-102436332023-06-07 Construction of a de novo assembly pipeline using multiple transcriptome data sets from Cypripedium macranthos (Orchidaceae) Kambara, Kota Fujino, Kaien Shimura, Hanako PLoS One Research Article The family Orchidaceae comprises the most species of any monocotyledonous family and has interesting characteristics such as seed germination induced by mycorrhizal fungi and flower morphology that co-adapted with pollinators. In orchid species, genomes have been decoded for only a few horticultural species, and there is little genetic information available. Generally, for species lacking sequenced genomes, gene sequences are predicted by de novo assembly of transcriptome data. Here, we devised a de novo assembly pipeline for transcriptome data from the wild orchid Cypripedium (lady slipper orchid) in Japan by mixing multiple data sets and integrating assemblies to create a more complete and less redundant contig set. Among the assemblies generated by combining various assemblers, Trinity and IDBA-Tran yielded good assembly with higher mapping rates and percentages of BLAST hit contigs and complete BUSCO. Using this contig set as a reference, we analyzed differential gene expression between protocorms grown aseptically or with mycorrhizal fungi to detect gene expressions required for mycorrhizal interaction. A pipeline proposed in this study can construct a highly reliable contig set with little redundancy even when multiple transcriptome data are mixed, and can provide a reference that is adaptable to DEG analysis and other downstream analysis in RNA-seq. Public Library of Science 2023-06-06 /pmc/articles/PMC10243633/ /pubmed/37279244 http://dx.doi.org/10.1371/journal.pone.0286804 Text en © 2023 Kambara et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Kambara, Kota
Fujino, Kaien
Shimura, Hanako
Construction of a de novo assembly pipeline using multiple transcriptome data sets from Cypripedium macranthos (Orchidaceae)
title Construction of a de novo assembly pipeline using multiple transcriptome data sets from Cypripedium macranthos (Orchidaceae)
title_full Construction of a de novo assembly pipeline using multiple transcriptome data sets from Cypripedium macranthos (Orchidaceae)
title_fullStr Construction of a de novo assembly pipeline using multiple transcriptome data sets from Cypripedium macranthos (Orchidaceae)
title_full_unstemmed Construction of a de novo assembly pipeline using multiple transcriptome data sets from Cypripedium macranthos (Orchidaceae)
title_short Construction of a de novo assembly pipeline using multiple transcriptome data sets from Cypripedium macranthos (Orchidaceae)
title_sort construction of a de novo assembly pipeline using multiple transcriptome data sets from cypripedium macranthos (orchidaceae)
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10243633/
https://www.ncbi.nlm.nih.gov/pubmed/37279244
http://dx.doi.org/10.1371/journal.pone.0286804
work_keys_str_mv AT kambarakota constructionofadenovoassemblypipelineusingmultipletranscriptomedatasetsfromcypripediummacranthosorchidaceae
AT fujinokaien constructionofadenovoassemblypipelineusingmultipletranscriptomedatasetsfromcypripediummacranthosorchidaceae
AT shimurahanako constructionofadenovoassemblypipelineusingmultipletranscriptomedatasetsfromcypripediummacranthosorchidaceae