Cargando…

Assessing genome assembly quality using the LTR Assembly Index (LAI)

Assembling a plant genome is challenging due to the abundance of repetitive sequences, yet no standard is available to evaluate the assembly of repeat space. LTR retrotransposons (LTR-RTs) are the predominant interspersed repeat that is poorly assembled in draft genomes. Here, we propose a reference...

Descripción completa

Detalles Bibliográficos
Autores principales: Ou, Shujun, Chen, Jinfeng, Jiang, Ning
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6265445/
https://www.ncbi.nlm.nih.gov/pubmed/30107434
http://dx.doi.org/10.1093/nar/gky730
_version_ 1783375637673672704
author Ou, Shujun
Chen, Jinfeng
Jiang, Ning
author_facet Ou, Shujun
Chen, Jinfeng
Jiang, Ning
author_sort Ou, Shujun
collection PubMed
description Assembling a plant genome is challenging due to the abundance of repetitive sequences, yet no standard is available to evaluate the assembly of repeat space. LTR retrotransposons (LTR-RTs) are the predominant interspersed repeat that is poorly assembled in draft genomes. Here, we propose a reference-free genome metric called LTR Assembly Index (LAI) that evaluates assembly continuity using LTR-RTs. After correcting for LTR-RT amplification dynamics, we show that LAI is independent of genome size, genomic LTR-RT content, and gene space evaluation metrics (i.e., BUSCO and CEGMA). By comparing genomic sequences produced by various sequencing techniques, we reveal the significant gain of assembly continuity by using long-read-based techniques over short-read-based methods. Moreover, LAI can facilitate iterative assembly improvement with assembler selection and identify low-quality genomic regions. To apply LAI, intact LTR-RTs and total LTR-RTs should contribute at least 0.1% and 5% to the genome size, respectively. The LAI program is freely available on GitHub: https://github.com/oushujun/LTR_retriever.
format Online
Article
Text
id pubmed-6265445
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-62654452018-12-04 Assessing genome assembly quality using the LTR Assembly Index (LAI) Ou, Shujun Chen, Jinfeng Jiang, Ning Nucleic Acids Res Methods Online Assembling a plant genome is challenging due to the abundance of repetitive sequences, yet no standard is available to evaluate the assembly of repeat space. LTR retrotransposons (LTR-RTs) are the predominant interspersed repeat that is poorly assembled in draft genomes. Here, we propose a reference-free genome metric called LTR Assembly Index (LAI) that evaluates assembly continuity using LTR-RTs. After correcting for LTR-RT amplification dynamics, we show that LAI is independent of genome size, genomic LTR-RT content, and gene space evaluation metrics (i.e., BUSCO and CEGMA). By comparing genomic sequences produced by various sequencing techniques, we reveal the significant gain of assembly continuity by using long-read-based techniques over short-read-based methods. Moreover, LAI can facilitate iterative assembly improvement with assembler selection and identify low-quality genomic regions. To apply LAI, intact LTR-RTs and total LTR-RTs should contribute at least 0.1% and 5% to the genome size, respectively. The LAI program is freely available on GitHub: https://github.com/oushujun/LTR_retriever. Oxford University Press 2018-11-30 2018-08-10 /pmc/articles/PMC6265445/ /pubmed/30107434 http://dx.doi.org/10.1093/nar/gky730 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Ou, Shujun
Chen, Jinfeng
Jiang, Ning
Assessing genome assembly quality using the LTR Assembly Index (LAI)
title Assessing genome assembly quality using the LTR Assembly Index (LAI)
title_full Assessing genome assembly quality using the LTR Assembly Index (LAI)
title_fullStr Assessing genome assembly quality using the LTR Assembly Index (LAI)
title_full_unstemmed Assessing genome assembly quality using the LTR Assembly Index (LAI)
title_short Assessing genome assembly quality using the LTR Assembly Index (LAI)
title_sort assessing genome assembly quality using the ltr assembly index (lai)
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6265445/
https://www.ncbi.nlm.nih.gov/pubmed/30107434
http://dx.doi.org/10.1093/nar/gky730
work_keys_str_mv AT oushujun assessinggenomeassemblyqualityusingtheltrassemblyindexlai
AT chenjinfeng assessinggenomeassemblyqualityusingtheltrassemblyindexlai
AT jiangning assessinggenomeassemblyqualityusingtheltrassemblyindexlai