Cargando…
Assessing genome assembly quality using the LTR Assembly Index (LAI)
Assembling a plant genome is challenging due to the abundance of repetitive sequences, yet no standard is available to evaluate the assembly of repeat space. LTR retrotransposons (LTR-RTs) are the predominant interspersed repeat that is poorly assembled in draft genomes. Here, we propose a reference...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6265445/ https://www.ncbi.nlm.nih.gov/pubmed/30107434 http://dx.doi.org/10.1093/nar/gky730 |
_version_ | 1783375637673672704 |
---|---|
author | Ou, Shujun Chen, Jinfeng Jiang, Ning |
author_facet | Ou, Shujun Chen, Jinfeng Jiang, Ning |
author_sort | Ou, Shujun |
collection | PubMed |
description | Assembling a plant genome is challenging due to the abundance of repetitive sequences, yet no standard is available to evaluate the assembly of repeat space. LTR retrotransposons (LTR-RTs) are the predominant interspersed repeat that is poorly assembled in draft genomes. Here, we propose a reference-free genome metric called LTR Assembly Index (LAI) that evaluates assembly continuity using LTR-RTs. After correcting for LTR-RT amplification dynamics, we show that LAI is independent of genome size, genomic LTR-RT content, and gene space evaluation metrics (i.e., BUSCO and CEGMA). By comparing genomic sequences produced by various sequencing techniques, we reveal the significant gain of assembly continuity by using long-read-based techniques over short-read-based methods. Moreover, LAI can facilitate iterative assembly improvement with assembler selection and identify low-quality genomic regions. To apply LAI, intact LTR-RTs and total LTR-RTs should contribute at least 0.1% and 5% to the genome size, respectively. The LAI program is freely available on GitHub: https://github.com/oushujun/LTR_retriever. |
format | Online Article Text |
id | pubmed-6265445 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-62654452018-12-04 Assessing genome assembly quality using the LTR Assembly Index (LAI) Ou, Shujun Chen, Jinfeng Jiang, Ning Nucleic Acids Res Methods Online Assembling a plant genome is challenging due to the abundance of repetitive sequences, yet no standard is available to evaluate the assembly of repeat space. LTR retrotransposons (LTR-RTs) are the predominant interspersed repeat that is poorly assembled in draft genomes. Here, we propose a reference-free genome metric called LTR Assembly Index (LAI) that evaluates assembly continuity using LTR-RTs. After correcting for LTR-RT amplification dynamics, we show that LAI is independent of genome size, genomic LTR-RT content, and gene space evaluation metrics (i.e., BUSCO and CEGMA). By comparing genomic sequences produced by various sequencing techniques, we reveal the significant gain of assembly continuity by using long-read-based techniques over short-read-based methods. Moreover, LAI can facilitate iterative assembly improvement with assembler selection and identify low-quality genomic regions. To apply LAI, intact LTR-RTs and total LTR-RTs should contribute at least 0.1% and 5% to the genome size, respectively. The LAI program is freely available on GitHub: https://github.com/oushujun/LTR_retriever. Oxford University Press 2018-11-30 2018-08-10 /pmc/articles/PMC6265445/ /pubmed/30107434 http://dx.doi.org/10.1093/nar/gky730 Text en © The Author(s) 2018. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Online Ou, Shujun Chen, Jinfeng Jiang, Ning Assessing genome assembly quality using the LTR Assembly Index (LAI) |
title | Assessing genome assembly quality using the LTR Assembly Index (LAI) |
title_full | Assessing genome assembly quality using the LTR Assembly Index (LAI) |
title_fullStr | Assessing genome assembly quality using the LTR Assembly Index (LAI) |
title_full_unstemmed | Assessing genome assembly quality using the LTR Assembly Index (LAI) |
title_short | Assessing genome assembly quality using the LTR Assembly Index (LAI) |
title_sort | assessing genome assembly quality using the ltr assembly index (lai) |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6265445/ https://www.ncbi.nlm.nih.gov/pubmed/30107434 http://dx.doi.org/10.1093/nar/gky730 |
work_keys_str_mv | AT oushujun assessinggenomeassemblyqualityusingtheltrassemblyindexlai AT chenjinfeng assessinggenomeassemblyqualityusingtheltrassemblyindexlai AT jiangning assessinggenomeassemblyqualityusingtheltrassemblyindexlai |