Cargando…

gcaPDA: a haplotype-resolved diploid assembler

BACKGROUND: Generating chromosome-scale haplotype resolved assembly is important for functional studies. However, current de novo assemblers are either haploid assemblers that discard allelic information, or diploid assemblers that can only tackle genomes of low complexity. RESULTS: Here, Using robu...

Descripción completa

Detalles Bibliográficos
Autores principales: Xie, Min, Yang, Linfeng, Jiang, Chenglin, Wu, Shenshen, Luo, Cheng, Yang, Xin, He, Lijuan, Chen, Shixuan, Deng, Tianquan, Ye, Mingzhi, Yan, Jianbing, Yang, Ning
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8842951/
https://www.ncbi.nlm.nih.gov/pubmed/35164674
http://dx.doi.org/10.1186/s12859-022-04591-4
Descripción
Sumario:BACKGROUND: Generating chromosome-scale haplotype resolved assembly is important for functional studies. However, current de novo assemblers are either haploid assemblers that discard allelic information, or diploid assemblers that can only tackle genomes of low complexity. RESULTS: Here, Using robust programs, we build a diploid genome assembly pipeline called gcaPDA (gamete cells assisted Phased Diploid Assembler), which exploits haploid gamete cells to assist in resolving haplotypes. We demonstrate the effectiveness of gcaPDA based on simulated HiFi reads of maize genome which is highly heterozygous and repetitive, and real data from rice. CONCLUSIONS: With applicability of coping with complex genomes and fewer restrictions on application than most of diploid assemblers, gcaPDA is likely to find broad applications in studies of eukaryotic genomes. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-04591-4.