Cargando…

Platanus-allee is a de novo haplotype assembler enabling a comprehensive access to divergent heterozygous regions

The ultimate goal for diploid genome determination is to completely decode homologous chromosomes independently, and several phasing programs from consensus sequences have been developed. These methods work well for lowly heterozygous genomes, but the manifold species have high heterozygosity. Addit...

Descripción completa

Detalles Bibliográficos
Autores principales: Kajitani, Rei, Yoshimura, Dai, Okuno, Miki, Minakuchi, Yohei, Kagoshima, Hiroshi, Fujiyama, Asao, Kubokawa, Kaoru, Kohara, Yuji, Toyoda, Atsushi, Itoh, Takehiko
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6461651/
https://www.ncbi.nlm.nih.gov/pubmed/30979905
http://dx.doi.org/10.1038/s41467-019-09575-2
Descripción
Sumario:The ultimate goal for diploid genome determination is to completely decode homologous chromosomes independently, and several phasing programs from consensus sequences have been developed. These methods work well for lowly heterozygous genomes, but the manifold species have high heterozygosity. Additionally, there are highly divergent regions (HDRs), where the haplotype sequences differ considerably. Because HDRs are likely to direct various interesting biological phenomena, many genomic analysis targets fall within these regions. However, they cannot be accessed by existing phasing methods, and we have to adopt costly traditional methods. Here, we develop a de novo haplotype assembler, Platanus-allee (http://platanus.bio.titech.ac.jp/platanus2), which initially constructs each haplotype sequence and then untangles the assembly graphs utilizing sequence links and synteny information. A comprehensive benchmark analysis reveals that Platanus-allee exhibits high recall and precision, particularly for HDRs. Using this approach, previously unknown HDRs are detected in the human genome, which may uncover novel aspects of genome variability.