Cargando…

An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals

Restriction site-associated DNA (RAD) sequencing is revolutionizing studies in ecological, evolutionary and conservation genomics. However, the assembly of paired-end RAD reads with random-sheared ends is still challenging, especially for non-model species with high genetic variance. Here, we presen...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Yu-Long, Xue, Dong-Xiu, Zhang, Bai-Dong, Liu, Jin-Xian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society Publishing 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5830760/
https://www.ncbi.nlm.nih.gov/pubmed/29515871
http://dx.doi.org/10.1098/rsos.171589
_version_ 1783303056140533760
author Li, Yu-Long
Xue, Dong-Xiu
Zhang, Bai-Dong
Liu, Jin-Xian
author_facet Li, Yu-Long
Xue, Dong-Xiu
Zhang, Bai-Dong
Liu, Jin-Xian
author_sort Li, Yu-Long
collection PubMed
description Restriction site-associated DNA (RAD) sequencing is revolutionizing studies in ecological, evolutionary and conservation genomics. However, the assembly of paired-end RAD reads with random-sheared ends is still challenging, especially for non-model species with high genetic variance. Here, we present an efficient optimized approach with a pipeline software, RADassembler, which makes full use of paired-end RAD reads with random-sheared ends from multiple individuals to assemble RAD contigs. RADassembler integrates the algorithms for choosing the optimal number of mismatches within and across individuals at the clustering stage, and then uses a two-step assembly approach at the assembly stage. RADassembler also uses data reduction and parallelization strategies to promote efficiency. Compared to other tools, both the assembly results based on simulation and real RAD datasets demonstrated that RADassembler could always assemble the appropriate number of contigs with high qualities, and more read pairs were properly mapped to the assembled contigs. This approach provides an optimal tool for dealing with the complexity in the assembly of paired-end RAD reads with random-sheared ends for non-model species in ecological, evolutionary and conservation studies. RADassembler is available at https://github.com/lyl8086/RADscripts.
format Online
Article
Text
id pubmed-5830760
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher The Royal Society Publishing
record_format MEDLINE/PubMed
spelling pubmed-58307602018-03-07 An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals Li, Yu-Long Xue, Dong-Xiu Zhang, Bai-Dong Liu, Jin-Xian R Soc Open Sci Genetics Restriction site-associated DNA (RAD) sequencing is revolutionizing studies in ecological, evolutionary and conservation genomics. However, the assembly of paired-end RAD reads with random-sheared ends is still challenging, especially for non-model species with high genetic variance. Here, we present an efficient optimized approach with a pipeline software, RADassembler, which makes full use of paired-end RAD reads with random-sheared ends from multiple individuals to assemble RAD contigs. RADassembler integrates the algorithms for choosing the optimal number of mismatches within and across individuals at the clustering stage, and then uses a two-step assembly approach at the assembly stage. RADassembler also uses data reduction and parallelization strategies to promote efficiency. Compared to other tools, both the assembly results based on simulation and real RAD datasets demonstrated that RADassembler could always assemble the appropriate number of contigs with high qualities, and more read pairs were properly mapped to the assembled contigs. This approach provides an optimal tool for dealing with the complexity in the assembly of paired-end RAD reads with random-sheared ends for non-model species in ecological, evolutionary and conservation studies. RADassembler is available at https://github.com/lyl8086/RADscripts. The Royal Society Publishing 2018-02-28 /pmc/articles/PMC5830760/ /pubmed/29515871 http://dx.doi.org/10.1098/rsos.171589 Text en © 2018 The Authors. http://creativecommons.org/licenses/by/4.0/ Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/, which permits unrestricted use, provided the original author and source are credited.
spellingShingle Genetics
Li, Yu-Long
Xue, Dong-Xiu
Zhang, Bai-Dong
Liu, Jin-Xian
An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals
title An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals
title_full An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals
title_fullStr An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals
title_full_unstemmed An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals
title_short An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals
title_sort optimized approach for local de novo assembly of overlapping paired-end rad reads from multiple individuals
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5830760/
https://www.ncbi.nlm.nih.gov/pubmed/29515871
http://dx.doi.org/10.1098/rsos.171589
work_keys_str_mv AT liyulong anoptimizedapproachforlocaldenovoassemblyofoverlappingpairedendradreadsfrommultipleindividuals
AT xuedongxiu anoptimizedapproachforlocaldenovoassemblyofoverlappingpairedendradreadsfrommultipleindividuals
AT zhangbaidong anoptimizedapproachforlocaldenovoassemblyofoverlappingpairedendradreadsfrommultipleindividuals
AT liujinxian anoptimizedapproachforlocaldenovoassemblyofoverlappingpairedendradreadsfrommultipleindividuals
AT liyulong optimizedapproachforlocaldenovoassemblyofoverlappingpairedendradreadsfrommultipleindividuals
AT xuedongxiu optimizedapproachforlocaldenovoassemblyofoverlappingpairedendradreadsfrommultipleindividuals
AT zhangbaidong optimizedapproachforlocaldenovoassemblyofoverlappingpairedendradreadsfrommultipleindividuals
AT liujinxian optimizedapproachforlocaldenovoassemblyofoverlappingpairedendradreadsfrommultipleindividuals