Cargando…

FGMP: assessing fungal genome completeness

BACKGROUND: Inexpensive high-throughput DNA sequencing has democratized access to genetic information for most organisms so that research utilizing a genome or transcriptome of an organism is not limited to model systems. However, the quality of the assemblies of sampled genomes can vary greatly whi...

Descripción completa

Detalles Bibliográficos
Autores principales: Cissé, Ousmane H., Stajich, Jason E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6466665/
https://www.ncbi.nlm.nih.gov/pubmed/30987585
http://dx.doi.org/10.1186/s12859-019-2782-9
_version_ 1783411151769436160
author Cissé, Ousmane H.
Stajich, Jason E.
author_facet Cissé, Ousmane H.
Stajich, Jason E.
author_sort Cissé, Ousmane H.
collection PubMed
description BACKGROUND: Inexpensive high-throughput DNA sequencing has democratized access to genetic information for most organisms so that research utilizing a genome or transcriptome of an organism is not limited to model systems. However, the quality of the assemblies of sampled genomes can vary greatly which hampers utility for comparisons and meaningful interpretation. The uncertainty of the completeness of a given genome sequence can limit feasibility of asserting patterns of high rates of gene loss reported in many lineages. RESULTS: We propose a computational framework and sequence resource for assessing completeness of fungal genomes called FGMP (Fungal Genome Mapping Project). Our approach is based on evolutionary conserved sets of proteins and DNA elements and is applicable to various types of genomic data. We present a comparison of FGMP and state-of-the-art methods for genome completeness assessment utilizing 246 genome assemblies of fungi. We discuss genome assembly improvements/degradations in 57 cases where assemblies have been updated, as recorded by NCBI assembly archive. CONCLUSION: FGMP is an accurate tool for quantifying level of completion from fungal genomic data. It is particularly useful for non-model organisms without reference genomes and can be used directly on unassembled reads, which can help reducing genome sequencing costs. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2782-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6466665
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-64666652019-04-22 FGMP: assessing fungal genome completeness Cissé, Ousmane H. Stajich, Jason E. BMC Bioinformatics Software BACKGROUND: Inexpensive high-throughput DNA sequencing has democratized access to genetic information for most organisms so that research utilizing a genome or transcriptome of an organism is not limited to model systems. However, the quality of the assemblies of sampled genomes can vary greatly which hampers utility for comparisons and meaningful interpretation. The uncertainty of the completeness of a given genome sequence can limit feasibility of asserting patterns of high rates of gene loss reported in many lineages. RESULTS: We propose a computational framework and sequence resource for assessing completeness of fungal genomes called FGMP (Fungal Genome Mapping Project). Our approach is based on evolutionary conserved sets of proteins and DNA elements and is applicable to various types of genomic data. We present a comparison of FGMP and state-of-the-art methods for genome completeness assessment utilizing 246 genome assemblies of fungi. We discuss genome assembly improvements/degradations in 57 cases where assemblies have been updated, as recorded by NCBI assembly archive. CONCLUSION: FGMP is an accurate tool for quantifying level of completion from fungal genomic data. It is particularly useful for non-model organisms without reference genomes and can be used directly on unassembled reads, which can help reducing genome sequencing costs. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2782-9) contains supplementary material, which is available to authorized users. BioMed Central 2019-04-15 /pmc/articles/PMC6466665/ /pubmed/30987585 http://dx.doi.org/10.1186/s12859-019-2782-9 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Cissé, Ousmane H.
Stajich, Jason E.
FGMP: assessing fungal genome completeness
title FGMP: assessing fungal genome completeness
title_full FGMP: assessing fungal genome completeness
title_fullStr FGMP: assessing fungal genome completeness
title_full_unstemmed FGMP: assessing fungal genome completeness
title_short FGMP: assessing fungal genome completeness
title_sort fgmp: assessing fungal genome completeness
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6466665/
https://www.ncbi.nlm.nih.gov/pubmed/30987585
http://dx.doi.org/10.1186/s12859-019-2782-9
work_keys_str_mv AT cisseousmaneh fgmpassessingfungalgenomecompleteness
AT stajichjasone fgmpassessingfungalgenomecompleteness