Cargando…

ImproveAssembly - Tool for identifying new gene products and improving genome assembly

The availability of biological information in public databases has increased exponentially. To ensure the accuracy of this information, researchers have adopted several methods and refinements to avoid the dissemination of incorrect information; for example, several automated tools are available for...

Descripción completa

Detalles Bibliográficos
Autores principales: Veras, Adonney Allan de Oliveira, Merlin, Bruno, de Sá, Pablo Henrique Caracciolo Gomes
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6203371/
https://www.ncbi.nlm.nih.gov/pubmed/30365512
http://dx.doi.org/10.1371/journal.pone.0206000
_version_ 1783365862574522368
author Veras, Adonney Allan de Oliveira
Merlin, Bruno
de Sá, Pablo Henrique Caracciolo Gomes
author_facet Veras, Adonney Allan de Oliveira
Merlin, Bruno
de Sá, Pablo Henrique Caracciolo Gomes
author_sort Veras, Adonney Allan de Oliveira
collection PubMed
description The availability of biological information in public databases has increased exponentially. To ensure the accuracy of this information, researchers have adopted several methods and refinements to avoid the dissemination of incorrect information; for example, several automated tools are available for annotation processes. However, manual curation ensures and enriches biological information. Additionally, the genomic finishing process is complex, resulting in increased deposition of drafts genomes. This introduces bias in other omics analyses because incomplete genomic content is used. This is also observed for complete genomes. For example, genomes generated by reference assembly may not include new products in the new sequence or errors or bias can occur during the assembly process. Thus, we developed ImproveAssembly, a tool capable of identifying new products missing from genomic sequences, which can be used for complete and draft genomes. The identified products can improve the annotation of complete genomes and drafts while significantly reducing the bias when the information is used in other omics analyses.
format Online
Article
Text
id pubmed-6203371
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-62033712018-11-19 ImproveAssembly - Tool for identifying new gene products and improving genome assembly Veras, Adonney Allan de Oliveira Merlin, Bruno de Sá, Pablo Henrique Caracciolo Gomes PLoS One Research Article The availability of biological information in public databases has increased exponentially. To ensure the accuracy of this information, researchers have adopted several methods and refinements to avoid the dissemination of incorrect information; for example, several automated tools are available for annotation processes. However, manual curation ensures and enriches biological information. Additionally, the genomic finishing process is complex, resulting in increased deposition of drafts genomes. This introduces bias in other omics analyses because incomplete genomic content is used. This is also observed for complete genomes. For example, genomes generated by reference assembly may not include new products in the new sequence or errors or bias can occur during the assembly process. Thus, we developed ImproveAssembly, a tool capable of identifying new products missing from genomic sequences, which can be used for complete and draft genomes. The identified products can improve the annotation of complete genomes and drafts while significantly reducing the bias when the information is used in other omics analyses. Public Library of Science 2018-10-26 /pmc/articles/PMC6203371/ /pubmed/30365512 http://dx.doi.org/10.1371/journal.pone.0206000 Text en © 2018 Veras et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Veras, Adonney Allan de Oliveira
Merlin, Bruno
de Sá, Pablo Henrique Caracciolo Gomes
ImproveAssembly - Tool for identifying new gene products and improving genome assembly
title ImproveAssembly - Tool for identifying new gene products and improving genome assembly
title_full ImproveAssembly - Tool for identifying new gene products and improving genome assembly
title_fullStr ImproveAssembly - Tool for identifying new gene products and improving genome assembly
title_full_unstemmed ImproveAssembly - Tool for identifying new gene products and improving genome assembly
title_short ImproveAssembly - Tool for identifying new gene products and improving genome assembly
title_sort improveassembly - tool for identifying new gene products and improving genome assembly
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6203371/
https://www.ncbi.nlm.nih.gov/pubmed/30365512
http://dx.doi.org/10.1371/journal.pone.0206000
work_keys_str_mv AT verasadonneyallandeoliveira improveassemblytoolforidentifyingnewgeneproductsandimprovinggenomeassembly
AT merlinbruno improveassemblytoolforidentifyingnewgeneproductsandimprovinggenomeassembly
AT desapablohenriquecaracciologomes improveassemblytoolforidentifyingnewgeneproductsandimprovinggenomeassembly