Cargando…

Review on the Computational Genome Annotation of Sequences Obtained by Next-Generation Sequencing

SIMPLE SUMMARY: Due to the development of high-throughput sequencing technologies, computational genome annotation of sequences has become one of the principal research area in computational biology. First, we reviewed comparative annotation tools and pipelines for both annotations of structures and...

Descripción completa

Detalles Bibliográficos
Autores principales: Ejigu, Girum Fitihamlak, Jung, Jaehee
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7565776/
https://www.ncbi.nlm.nih.gov/pubmed/32962098
http://dx.doi.org/10.3390/biology9090295
_version_ 1783596006834700288
author Ejigu, Girum Fitihamlak
Jung, Jaehee
author_facet Ejigu, Girum Fitihamlak
Jung, Jaehee
author_sort Ejigu, Girum Fitihamlak
collection PubMed
description SIMPLE SUMMARY: Due to the development of high-throughput sequencing technologies, computational genome annotation of sequences has become one of the principal research area in computational biology. First, we reviewed comparative annotation tools and pipelines for both annotations of structures and functions, which enable us to comprehend gene functions and their genome evolution. Second, we compared genome annotation tools that utilize homology-based and ab initio methods depending on the similarity of sequences or the lack of evidences. Third, we explored visualization tools that aid the annotation process and stressed the need for the quality control of annotations and re-annotations, because misannotations may happen due to experimental errors or missed genes by preceding technologies. Finally, we highlighted how emerging technologies can be used in future annotations. ABSTRACT: Next-Generation Sequencing (NGS) has made it easier to obtain genome-wide sequence data and it has shifted the research focus into genome annotation. The challenging tasks involved in annotation rely on the currently available tools and techniques to decode the information contained in nucleotide sequences. This information will improve our understanding of general aspects of life and evolution and improve our ability to diagnose genetic disorders. Here, we present a summary of both structural and functional annotations, as well as the associated comparative annotation tools and pipelines. We highlight visualization tools that immensely aid the annotation process and the contributions of the scientific community to the annotation. Further, we discuss quality-control practices and the need for re-annotation, and highlight the future of annotation.
format Online
Article
Text
id pubmed-7565776
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75657762020-10-26 Review on the Computational Genome Annotation of Sequences Obtained by Next-Generation Sequencing Ejigu, Girum Fitihamlak Jung, Jaehee Biology (Basel) Review SIMPLE SUMMARY: Due to the development of high-throughput sequencing technologies, computational genome annotation of sequences has become one of the principal research area in computational biology. First, we reviewed comparative annotation tools and pipelines for both annotations of structures and functions, which enable us to comprehend gene functions and their genome evolution. Second, we compared genome annotation tools that utilize homology-based and ab initio methods depending on the similarity of sequences or the lack of evidences. Third, we explored visualization tools that aid the annotation process and stressed the need for the quality control of annotations and re-annotations, because misannotations may happen due to experimental errors or missed genes by preceding technologies. Finally, we highlighted how emerging technologies can be used in future annotations. ABSTRACT: Next-Generation Sequencing (NGS) has made it easier to obtain genome-wide sequence data and it has shifted the research focus into genome annotation. The challenging tasks involved in annotation rely on the currently available tools and techniques to decode the information contained in nucleotide sequences. This information will improve our understanding of general aspects of life and evolution and improve our ability to diagnose genetic disorders. Here, we present a summary of both structural and functional annotations, as well as the associated comparative annotation tools and pipelines. We highlight visualization tools that immensely aid the annotation process and the contributions of the scientific community to the annotation. Further, we discuss quality-control practices and the need for re-annotation, and highlight the future of annotation. MDPI 2020-09-18 /pmc/articles/PMC7565776/ /pubmed/32962098 http://dx.doi.org/10.3390/biology9090295 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Review
Ejigu, Girum Fitihamlak
Jung, Jaehee
Review on the Computational Genome Annotation of Sequences Obtained by Next-Generation Sequencing
title Review on the Computational Genome Annotation of Sequences Obtained by Next-Generation Sequencing
title_full Review on the Computational Genome Annotation of Sequences Obtained by Next-Generation Sequencing
title_fullStr Review on the Computational Genome Annotation of Sequences Obtained by Next-Generation Sequencing
title_full_unstemmed Review on the Computational Genome Annotation of Sequences Obtained by Next-Generation Sequencing
title_short Review on the Computational Genome Annotation of Sequences Obtained by Next-Generation Sequencing
title_sort review on the computational genome annotation of sequences obtained by next-generation sequencing
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7565776/
https://www.ncbi.nlm.nih.gov/pubmed/32962098
http://dx.doi.org/10.3390/biology9090295
work_keys_str_mv AT ejigugirumfitihamlak reviewonthecomputationalgenomeannotationofsequencesobtainedbynextgenerationsequencing
AT jungjaehee reviewonthecomputationalgenomeannotationofsequencesobtainedbynextgenerationsequencing