Cargando…

Disentangling homeologous contigs in allo-tetraploid assembly: application to durum wheat

BACKGROUND: Using Next Generation Sequencing, SNP discovery is relatively easy on diploid species and still hampered in polyploid species by the confusion due to homeology. We develop HomeoSplitter; a fast and effective solution to split original contigs obtained by RNAseq into two homeologous seque...

Descripción completa

Detalles Bibliográficos
Autores principales: Ranwez, Vincent, Holtz, Yan, Sarah, Gautier, Ardisson, Morgane, Santoni, Sylvain, Glémin, Sylvain, Tavaud-Pirra, Muriel, David, Jacques
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3851826/
https://www.ncbi.nlm.nih.gov/pubmed/24564644
http://dx.doi.org/10.1186/1471-2105-14-S15-S15
_version_ 1782294359515856896
author Ranwez, Vincent
Holtz, Yan
Sarah, Gautier
Ardisson, Morgane
Santoni, Sylvain
Glémin, Sylvain
Tavaud-Pirra, Muriel
David, Jacques
author_facet Ranwez, Vincent
Holtz, Yan
Sarah, Gautier
Ardisson, Morgane
Santoni, Sylvain
Glémin, Sylvain
Tavaud-Pirra, Muriel
David, Jacques
author_sort Ranwez, Vincent
collection PubMed
description BACKGROUND: Using Next Generation Sequencing, SNP discovery is relatively easy on diploid species and still hampered in polyploid species by the confusion due to homeology. We develop HomeoSplitter; a fast and effective solution to split original contigs obtained by RNAseq into two homeologous sequences. It uses the differential expression of the two homeologous genes in the RNA. We verify that the new sequences are closer to the diploid progenitors of the allopolyploid species than the original contig. By remapping original reads on these new sequences, we also verify that the number of valuable detected SNPs has significantly increased. Thirty accessions of the tetraploid durum wheat (Triticum turgidum, A and B genomes) were sequenced in pooled cDNA libraries. Reads were assembled in a de novo durum assembly. Transcriptomes of the diploid species, Aegilops speltoides (close B genome) and Triticum urartu (A genome) were used as reference to benchmark the method. RESULTS: HomeoSplitter is a fast and effective solution to disentangle homeologous sequences based on a maximum likelihood optimization. On a benchmark set of 2,505 clusters containing homologous sequences of urartu, speltoides and durum, HomeoSplitter was efficient to build sequences closer to the diploid references and increased the number of valuable SNPs from 188 out of 1,360 SNPs detected when mapping the reads on the de novo durum assembly to 762 out of 1,620 SNPs when mapping on HomeoSplitter contigs. CONCLUSIONS: The HomeoSplitter program is freely available at http://bioweb.supagro.inra.fr/homeoSplitter/. This work provides a practical solution to the complex problem of disentangling homeologous transcripts in allo-tetraploids, which further allows an improved SNP detection.
format Online
Article
Text
id pubmed-3851826
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38518262013-12-20 Disentangling homeologous contigs in allo-tetraploid assembly: application to durum wheat Ranwez, Vincent Holtz, Yan Sarah, Gautier Ardisson, Morgane Santoni, Sylvain Glémin, Sylvain Tavaud-Pirra, Muriel David, Jacques BMC Bioinformatics Proceedings BACKGROUND: Using Next Generation Sequencing, SNP discovery is relatively easy on diploid species and still hampered in polyploid species by the confusion due to homeology. We develop HomeoSplitter; a fast and effective solution to split original contigs obtained by RNAseq into two homeologous sequences. It uses the differential expression of the two homeologous genes in the RNA. We verify that the new sequences are closer to the diploid progenitors of the allopolyploid species than the original contig. By remapping original reads on these new sequences, we also verify that the number of valuable detected SNPs has significantly increased. Thirty accessions of the tetraploid durum wheat (Triticum turgidum, A and B genomes) were sequenced in pooled cDNA libraries. Reads were assembled in a de novo durum assembly. Transcriptomes of the diploid species, Aegilops speltoides (close B genome) and Triticum urartu (A genome) were used as reference to benchmark the method. RESULTS: HomeoSplitter is a fast and effective solution to disentangle homeologous sequences based on a maximum likelihood optimization. On a benchmark set of 2,505 clusters containing homologous sequences of urartu, speltoides and durum, HomeoSplitter was efficient to build sequences closer to the diploid references and increased the number of valuable SNPs from 188 out of 1,360 SNPs detected when mapping the reads on the de novo durum assembly to 762 out of 1,620 SNPs when mapping on HomeoSplitter contigs. CONCLUSIONS: The HomeoSplitter program is freely available at http://bioweb.supagro.inra.fr/homeoSplitter/. This work provides a practical solution to the complex problem of disentangling homeologous transcripts in allo-tetraploids, which further allows an improved SNP detection. BioMed Central 2013-10-15 /pmc/articles/PMC3851826/ /pubmed/24564644 http://dx.doi.org/10.1186/1471-2105-14-S15-S15 Text en Copyright © 2013 Ranwez et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Ranwez, Vincent
Holtz, Yan
Sarah, Gautier
Ardisson, Morgane
Santoni, Sylvain
Glémin, Sylvain
Tavaud-Pirra, Muriel
David, Jacques
Disentangling homeologous contigs in allo-tetraploid assembly: application to durum wheat
title Disentangling homeologous contigs in allo-tetraploid assembly: application to durum wheat
title_full Disentangling homeologous contigs in allo-tetraploid assembly: application to durum wheat
title_fullStr Disentangling homeologous contigs in allo-tetraploid assembly: application to durum wheat
title_full_unstemmed Disentangling homeologous contigs in allo-tetraploid assembly: application to durum wheat
title_short Disentangling homeologous contigs in allo-tetraploid assembly: application to durum wheat
title_sort disentangling homeologous contigs in allo-tetraploid assembly: application to durum wheat
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3851826/
https://www.ncbi.nlm.nih.gov/pubmed/24564644
http://dx.doi.org/10.1186/1471-2105-14-S15-S15
work_keys_str_mv AT ranwezvincent disentanglinghomeologouscontigsinallotetraploidassemblyapplicationtodurumwheat
AT holtzyan disentanglinghomeologouscontigsinallotetraploidassemblyapplicationtodurumwheat
AT sarahgautier disentanglinghomeologouscontigsinallotetraploidassemblyapplicationtodurumwheat
AT ardissonmorgane disentanglinghomeologouscontigsinallotetraploidassemblyapplicationtodurumwheat
AT santonisylvain disentanglinghomeologouscontigsinallotetraploidassemblyapplicationtodurumwheat
AT gleminsylvain disentanglinghomeologouscontigsinallotetraploidassemblyapplicationtodurumwheat
AT tavaudpirramuriel disentanglinghomeologouscontigsinallotetraploidassemblyapplicationtodurumwheat
AT davidjacques disentanglinghomeologouscontigsinallotetraploidassemblyapplicationtodurumwheat