Cargando…
Disentangling homeologous contigs in allo-tetraploid assembly: application to durum wheat
BACKGROUND: Using Next Generation Sequencing, SNP discovery is relatively easy on diploid species and still hampered in polyploid species by the confusion due to homeology. We develop HomeoSplitter; a fast and effective solution to split original contigs obtained by RNAseq into two homeologous seque...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3851826/ https://www.ncbi.nlm.nih.gov/pubmed/24564644 http://dx.doi.org/10.1186/1471-2105-14-S15-S15 |
_version_ | 1782294359515856896 |
---|---|
author | Ranwez, Vincent Holtz, Yan Sarah, Gautier Ardisson, Morgane Santoni, Sylvain Glémin, Sylvain Tavaud-Pirra, Muriel David, Jacques |
author_facet | Ranwez, Vincent Holtz, Yan Sarah, Gautier Ardisson, Morgane Santoni, Sylvain Glémin, Sylvain Tavaud-Pirra, Muriel David, Jacques |
author_sort | Ranwez, Vincent |
collection | PubMed |
description | BACKGROUND: Using Next Generation Sequencing, SNP discovery is relatively easy on diploid species and still hampered in polyploid species by the confusion due to homeology. We develop HomeoSplitter; a fast and effective solution to split original contigs obtained by RNAseq into two homeologous sequences. It uses the differential expression of the two homeologous genes in the RNA. We verify that the new sequences are closer to the diploid progenitors of the allopolyploid species than the original contig. By remapping original reads on these new sequences, we also verify that the number of valuable detected SNPs has significantly increased. Thirty accessions of the tetraploid durum wheat (Triticum turgidum, A and B genomes) were sequenced in pooled cDNA libraries. Reads were assembled in a de novo durum assembly. Transcriptomes of the diploid species, Aegilops speltoides (close B genome) and Triticum urartu (A genome) were used as reference to benchmark the method. RESULTS: HomeoSplitter is a fast and effective solution to disentangle homeologous sequences based on a maximum likelihood optimization. On a benchmark set of 2,505 clusters containing homologous sequences of urartu, speltoides and durum, HomeoSplitter was efficient to build sequences closer to the diploid references and increased the number of valuable SNPs from 188 out of 1,360 SNPs detected when mapping the reads on the de novo durum assembly to 762 out of 1,620 SNPs when mapping on HomeoSplitter contigs. CONCLUSIONS: The HomeoSplitter program is freely available at http://bioweb.supagro.inra.fr/homeoSplitter/. This work provides a practical solution to the complex problem of disentangling homeologous transcripts in allo-tetraploids, which further allows an improved SNP detection. |
format | Online Article Text |
id | pubmed-3851826 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-38518262013-12-20 Disentangling homeologous contigs in allo-tetraploid assembly: application to durum wheat Ranwez, Vincent Holtz, Yan Sarah, Gautier Ardisson, Morgane Santoni, Sylvain Glémin, Sylvain Tavaud-Pirra, Muriel David, Jacques BMC Bioinformatics Proceedings BACKGROUND: Using Next Generation Sequencing, SNP discovery is relatively easy on diploid species and still hampered in polyploid species by the confusion due to homeology. We develop HomeoSplitter; a fast and effective solution to split original contigs obtained by RNAseq into two homeologous sequences. It uses the differential expression of the two homeologous genes in the RNA. We verify that the new sequences are closer to the diploid progenitors of the allopolyploid species than the original contig. By remapping original reads on these new sequences, we also verify that the number of valuable detected SNPs has significantly increased. Thirty accessions of the tetraploid durum wheat (Triticum turgidum, A and B genomes) were sequenced in pooled cDNA libraries. Reads were assembled in a de novo durum assembly. Transcriptomes of the diploid species, Aegilops speltoides (close B genome) and Triticum urartu (A genome) were used as reference to benchmark the method. RESULTS: HomeoSplitter is a fast and effective solution to disentangle homeologous sequences based on a maximum likelihood optimization. On a benchmark set of 2,505 clusters containing homologous sequences of urartu, speltoides and durum, HomeoSplitter was efficient to build sequences closer to the diploid references and increased the number of valuable SNPs from 188 out of 1,360 SNPs detected when mapping the reads on the de novo durum assembly to 762 out of 1,620 SNPs when mapping on HomeoSplitter contigs. CONCLUSIONS: The HomeoSplitter program is freely available at http://bioweb.supagro.inra.fr/homeoSplitter/. This work provides a practical solution to the complex problem of disentangling homeologous transcripts in allo-tetraploids, which further allows an improved SNP detection. BioMed Central 2013-10-15 /pmc/articles/PMC3851826/ /pubmed/24564644 http://dx.doi.org/10.1186/1471-2105-14-S15-S15 Text en Copyright © 2013 Ranwez et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Ranwez, Vincent Holtz, Yan Sarah, Gautier Ardisson, Morgane Santoni, Sylvain Glémin, Sylvain Tavaud-Pirra, Muriel David, Jacques Disentangling homeologous contigs in allo-tetraploid assembly: application to durum wheat |
title | Disentangling homeologous contigs in allo-tetraploid assembly: application to durum wheat |
title_full | Disentangling homeologous contigs in allo-tetraploid assembly: application to durum wheat |
title_fullStr | Disentangling homeologous contigs in allo-tetraploid assembly: application to durum wheat |
title_full_unstemmed | Disentangling homeologous contigs in allo-tetraploid assembly: application to durum wheat |
title_short | Disentangling homeologous contigs in allo-tetraploid assembly: application to durum wheat |
title_sort | disentangling homeologous contigs in allo-tetraploid assembly: application to durum wheat |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3851826/ https://www.ncbi.nlm.nih.gov/pubmed/24564644 http://dx.doi.org/10.1186/1471-2105-14-S15-S15 |
work_keys_str_mv | AT ranwezvincent disentanglinghomeologouscontigsinallotetraploidassemblyapplicationtodurumwheat AT holtzyan disentanglinghomeologouscontigsinallotetraploidassemblyapplicationtodurumwheat AT sarahgautier disentanglinghomeologouscontigsinallotetraploidassemblyapplicationtodurumwheat AT ardissonmorgane disentanglinghomeologouscontigsinallotetraploidassemblyapplicationtodurumwheat AT santonisylvain disentanglinghomeologouscontigsinallotetraploidassemblyapplicationtodurumwheat AT gleminsylvain disentanglinghomeologouscontigsinallotetraploidassemblyapplicationtodurumwheat AT tavaudpirramuriel disentanglinghomeologouscontigsinallotetraploidassemblyapplicationtodurumwheat AT davidjacques disentanglinghomeologouscontigsinallotetraploidassemblyapplicationtodurumwheat |