Cargando…

Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut

The palms are a family of tropical origin and one of the main constituents of the ecosystems of these regions around the world. The two main species of palm represent different challenges: coconut (Cocos nucifera L.) is a source of multiple goods and services in tropical communities, while oil palm...

Descripción completa

Detalles Bibliográficos
Autores principales: Armero, Alix, Baudouin, Luc, Bocs, Stéphanie, This, Dominique
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5363918/
https://www.ncbi.nlm.nih.gov/pubmed/28334050
http://dx.doi.org/10.1371/journal.pone.0173300
_version_ 1782517233801494528
author Armero, Alix
Baudouin, Luc
Bocs, Stéphanie
This, Dominique
author_facet Armero, Alix
Baudouin, Luc
Bocs, Stéphanie
This, Dominique
author_sort Armero, Alix
collection PubMed
description The palms are a family of tropical origin and one of the main constituents of the ecosystems of these regions around the world. The two main species of palm represent different challenges: coconut (Cocos nucifera L.) is a source of multiple goods and services in tropical communities, while oil palm (Elaeis guineensis Jacq) is the main protagonist of the oil market. In this study, we present a workflow that exploits the comparative genomics between a target species (coconut) and a reference species (oil palm) to improve the transcriptomic data, providing a proteome useful to answer functional or evolutionary questions. This workflow reduces redundancy and fragmentation, two inherent problems of transcriptomic data, while preserving the functional representation of the target species. Our approach was validated in Arabidopsis thaliana using Arabidopsis lyrata and Capsella rubella as references species. This analysis showed the high sensitivity and specificity of our strategy, relatively independent of the reference proteome. The workflow increased the length of proteins products in A. thaliana by 13%, allowing, often, to recover 100% of the protein sequence length. In addition redundancy was reduced by a factor greater than 3. In coconut, the approach generated 29,366 proteins, 1,246 of these proteins deriving from new contigs obtained with the BRANCH software. The coconut proteome presented a functional profile similar to that observed in rice and an important number of metabolic pathways related to secondary metabolism. The new sequences found with BRANCH software were enriched in functions related to biotic stress. Our strategy can be used as a complementary step to de novo transcriptome assembly to get a representative proteome of a target species. The results of the current analysis are available on the website PalmComparomics (http://palm-comparomics.southgreen.fr/).
format Online
Article
Text
id pubmed-5363918
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-53639182017-04-06 Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut Armero, Alix Baudouin, Luc Bocs, Stéphanie This, Dominique PLoS One Research Article The palms are a family of tropical origin and one of the main constituents of the ecosystems of these regions around the world. The two main species of palm represent different challenges: coconut (Cocos nucifera L.) is a source of multiple goods and services in tropical communities, while oil palm (Elaeis guineensis Jacq) is the main protagonist of the oil market. In this study, we present a workflow that exploits the comparative genomics between a target species (coconut) and a reference species (oil palm) to improve the transcriptomic data, providing a proteome useful to answer functional or evolutionary questions. This workflow reduces redundancy and fragmentation, two inherent problems of transcriptomic data, while preserving the functional representation of the target species. Our approach was validated in Arabidopsis thaliana using Arabidopsis lyrata and Capsella rubella as references species. This analysis showed the high sensitivity and specificity of our strategy, relatively independent of the reference proteome. The workflow increased the length of proteins products in A. thaliana by 13%, allowing, often, to recover 100% of the protein sequence length. In addition redundancy was reduced by a factor greater than 3. In coconut, the approach generated 29,366 proteins, 1,246 of these proteins deriving from new contigs obtained with the BRANCH software. The coconut proteome presented a functional profile similar to that observed in rice and an important number of metabolic pathways related to secondary metabolism. The new sequences found with BRANCH software were enriched in functions related to biotic stress. Our strategy can be used as a complementary step to de novo transcriptome assembly to get a representative proteome of a target species. The results of the current analysis are available on the website PalmComparomics (http://palm-comparomics.southgreen.fr/). Public Library of Science 2017-03-23 /pmc/articles/PMC5363918/ /pubmed/28334050 http://dx.doi.org/10.1371/journal.pone.0173300 Text en © 2017 Armero et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Armero, Alix
Baudouin, Luc
Bocs, Stéphanie
This, Dominique
Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut
title Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut
title_full Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut
title_fullStr Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut
title_full_unstemmed Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut
title_short Improving transcriptome de novo assembly by using a reference genome of a related species: Translational genomics from oil palm to coconut
title_sort improving transcriptome de novo assembly by using a reference genome of a related species: translational genomics from oil palm to coconut
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5363918/
https://www.ncbi.nlm.nih.gov/pubmed/28334050
http://dx.doi.org/10.1371/journal.pone.0173300
work_keys_str_mv AT armeroalix improvingtranscriptomedenovoassemblybyusingareferencegenomeofarelatedspeciestranslationalgenomicsfromoilpalmtococonut
AT baudouinluc improvingtranscriptomedenovoassemblybyusingareferencegenomeofarelatedspeciestranslationalgenomicsfromoilpalmtococonut
AT bocsstephanie improvingtranscriptomedenovoassemblybyusingareferencegenomeofarelatedspeciestranslationalgenomicsfromoilpalmtococonut
AT thisdominique improvingtranscriptomedenovoassemblybyusingareferencegenomeofarelatedspeciestranslationalgenomicsfromoilpalmtococonut