Cargando…

Comparative analyses of two Geraniaceae transcriptomes using next-generation sequencing

BACKGROUND: Organelle genomes of Geraniaceae exhibit several unusual evolutionary phenomena compared to other angiosperm families including accelerated nucleotide substitution rates, widespread gene loss, reduced RNA editing, and extensive genomic rearrangements. Since most organelle-encoded protein...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Jin, Ruhlman, Tracey A, Mower, Jeffrey P, Jansen, Robert K
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3880972/
https://www.ncbi.nlm.nih.gov/pubmed/24373163
http://dx.doi.org/10.1186/1471-2229-13-228
_version_ 1782298135245094912
author Zhang, Jin
Ruhlman, Tracey A
Mower, Jeffrey P
Jansen, Robert K
author_facet Zhang, Jin
Ruhlman, Tracey A
Mower, Jeffrey P
Jansen, Robert K
author_sort Zhang, Jin
collection PubMed
description BACKGROUND: Organelle genomes of Geraniaceae exhibit several unusual evolutionary phenomena compared to other angiosperm families including accelerated nucleotide substitution rates, widespread gene loss, reduced RNA editing, and extensive genomic rearrangements. Since most organelle-encoded proteins function in multi-subunit complexes that also contain nuclear-encoded proteins, it is likely that the atypical organellar phenomena affect the evolution of nuclear genes encoding organellar proteins. To begin to unravel the complex co-evolutionary interplay between organellar and nuclear genomes in this family, we sequenced nuclear transcriptomes of two species, Geranium maderense and Pelargonium x hortorum. RESULTS: Normalized cDNA libraries of G. maderense and P. x hortorum were used for transcriptome sequencing. Five assemblers (MIRA, Newbler, SOAPdenovo, SOAPdenovo-trans [SOAPtrans], Trinity) and two next-generation technologies (454 and Illumina) were compared to determine the optimal transcriptome sequencing approach. Trinity provided the highest quality assembly of Illumina data with the deepest transcriptome coverage. An analysis to determine the amount of sequencing needed for de novo assembly revealed diminishing returns of coverage and quality with data sets larger than sixty million Illumina paired end reads for both species. The G. maderense and P. x hortorum transcriptomes contained fewer transcripts encoding the PLS subclass of PPR proteins relative to other angiosperms, consistent with reduced mitochondrial RNA editing activity in Geraniaceae. In addition, transcripts for all six plastid targeted sigma factors were identified in both transcriptomes, suggesting that one of the highly divergent rpoA-like ORFs in the P. x hortorum plastid genome is functional. CONCLUSIONS: The findings support the use of the Illumina platform and assemblers optimized for transcriptome assembly, such as Trinity or SOAPtrans, to generate high-quality de novo transcriptomes with broad coverage. In addition, results indicated no major improvements in breadth of coverage with data sets larger than six billion nucleotides or when sampling RNA from four tissue types rather than from a single tissue. Finally, this work demonstrates the power of cross-compartmental genomic analyses to deepen our understanding of the correlated evolution of the nuclear, plastid, and mitochondrial genomes in plants.
format Online
Article
Text
id pubmed-3880972
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38809722014-01-07 Comparative analyses of two Geraniaceae transcriptomes using next-generation sequencing Zhang, Jin Ruhlman, Tracey A Mower, Jeffrey P Jansen, Robert K BMC Plant Biol Research Article BACKGROUND: Organelle genomes of Geraniaceae exhibit several unusual evolutionary phenomena compared to other angiosperm families including accelerated nucleotide substitution rates, widespread gene loss, reduced RNA editing, and extensive genomic rearrangements. Since most organelle-encoded proteins function in multi-subunit complexes that also contain nuclear-encoded proteins, it is likely that the atypical organellar phenomena affect the evolution of nuclear genes encoding organellar proteins. To begin to unravel the complex co-evolutionary interplay between organellar and nuclear genomes in this family, we sequenced nuclear transcriptomes of two species, Geranium maderense and Pelargonium x hortorum. RESULTS: Normalized cDNA libraries of G. maderense and P. x hortorum were used for transcriptome sequencing. Five assemblers (MIRA, Newbler, SOAPdenovo, SOAPdenovo-trans [SOAPtrans], Trinity) and two next-generation technologies (454 and Illumina) were compared to determine the optimal transcriptome sequencing approach. Trinity provided the highest quality assembly of Illumina data with the deepest transcriptome coverage. An analysis to determine the amount of sequencing needed for de novo assembly revealed diminishing returns of coverage and quality with data sets larger than sixty million Illumina paired end reads for both species. The G. maderense and P. x hortorum transcriptomes contained fewer transcripts encoding the PLS subclass of PPR proteins relative to other angiosperms, consistent with reduced mitochondrial RNA editing activity in Geraniaceae. In addition, transcripts for all six plastid targeted sigma factors were identified in both transcriptomes, suggesting that one of the highly divergent rpoA-like ORFs in the P. x hortorum plastid genome is functional. CONCLUSIONS: The findings support the use of the Illumina platform and assemblers optimized for transcriptome assembly, such as Trinity or SOAPtrans, to generate high-quality de novo transcriptomes with broad coverage. In addition, results indicated no major improvements in breadth of coverage with data sets larger than six billion nucleotides or when sampling RNA from four tissue types rather than from a single tissue. Finally, this work demonstrates the power of cross-compartmental genomic analyses to deepen our understanding of the correlated evolution of the nuclear, plastid, and mitochondrial genomes in plants. BioMed Central 2013-12-29 /pmc/articles/PMC3880972/ /pubmed/24373163 http://dx.doi.org/10.1186/1471-2229-13-228 Text en Copyright © 2013 Zhang et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhang, Jin
Ruhlman, Tracey A
Mower, Jeffrey P
Jansen, Robert K
Comparative analyses of two Geraniaceae transcriptomes using next-generation sequencing
title Comparative analyses of two Geraniaceae transcriptomes using next-generation sequencing
title_full Comparative analyses of two Geraniaceae transcriptomes using next-generation sequencing
title_fullStr Comparative analyses of two Geraniaceae transcriptomes using next-generation sequencing
title_full_unstemmed Comparative analyses of two Geraniaceae transcriptomes using next-generation sequencing
title_short Comparative analyses of two Geraniaceae transcriptomes using next-generation sequencing
title_sort comparative analyses of two geraniaceae transcriptomes using next-generation sequencing
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3880972/
https://www.ncbi.nlm.nih.gov/pubmed/24373163
http://dx.doi.org/10.1186/1471-2229-13-228
work_keys_str_mv AT zhangjin comparativeanalysesoftwogeraniaceaetranscriptomesusingnextgenerationsequencing
AT ruhlmantraceya comparativeanalysesoftwogeraniaceaetranscriptomesusingnextgenerationsequencing
AT mowerjeffreyp comparativeanalysesoftwogeraniaceaetranscriptomesusingnextgenerationsequencing
AT jansenrobertk comparativeanalysesoftwogeraniaceaetranscriptomesusingnextgenerationsequencing