Cargando…

Genome annotation: From human genetics to biodiversity genomics

Within the next decade, the genomes of 1.8 million eukaryotic species will be sequenced. Identifying genes in these sequences is essential to understand the biology of the species. This is challenging due to the transcriptional complexity of eukaryotic genomes, which encode hundreds of thousands of...

Descripción completa

Detalles Bibliográficos
Autor principal: Guigó, Roderic
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10435374/
https://www.ncbi.nlm.nih.gov/pubmed/37601977
http://dx.doi.org/10.1016/j.xgen.2023.100375
_version_ 1785092081186766848
author Guigó, Roderic
author_facet Guigó, Roderic
author_sort Guigó, Roderic
collection PubMed
description Within the next decade, the genomes of 1.8 million eukaryotic species will be sequenced. Identifying genes in these sequences is essential to understand the biology of the species. This is challenging due to the transcriptional complexity of eukaryotic genomes, which encode hundreds of thousands of transcripts of multiple types. Among these, a small set of protein-coding mRNAs play a disproportionately large role in defining phenotypes. Due to their sequence conservation, orthology can be established, making it possible to define the universal catalog of eukaryotic protein-coding genes. This catalog should substantially contribute to uncovering the genomic events underlying the emergence of eukaryotic phenotypes. This piece briefly reviews the basics of protein-coding gene prediction, discusses challenges in finalizing annotation of the human genome, and proposes strategies for producing annotations across the eukaryotic Tree of Life. This lays the groundwork for obtaining the catalog of all genes—the Earth’s code of life.
format Online
Article
Text
id pubmed-10435374
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-104353742023-08-19 Genome annotation: From human genetics to biodiversity genomics Guigó, Roderic Cell Genom Perspective Within the next decade, the genomes of 1.8 million eukaryotic species will be sequenced. Identifying genes in these sequences is essential to understand the biology of the species. This is challenging due to the transcriptional complexity of eukaryotic genomes, which encode hundreds of thousands of transcripts of multiple types. Among these, a small set of protein-coding mRNAs play a disproportionately large role in defining phenotypes. Due to their sequence conservation, orthology can be established, making it possible to define the universal catalog of eukaryotic protein-coding genes. This catalog should substantially contribute to uncovering the genomic events underlying the emergence of eukaryotic phenotypes. This piece briefly reviews the basics of protein-coding gene prediction, discusses challenges in finalizing annotation of the human genome, and proposes strategies for producing annotations across the eukaryotic Tree of Life. This lays the groundwork for obtaining the catalog of all genes—the Earth’s code of life. Elsevier 2023-08-01 /pmc/articles/PMC10435374/ /pubmed/37601977 http://dx.doi.org/10.1016/j.xgen.2023.100375 Text en © 2023 The Author https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Perspective
Guigó, Roderic
Genome annotation: From human genetics to biodiversity genomics
title Genome annotation: From human genetics to biodiversity genomics
title_full Genome annotation: From human genetics to biodiversity genomics
title_fullStr Genome annotation: From human genetics to biodiversity genomics
title_full_unstemmed Genome annotation: From human genetics to biodiversity genomics
title_short Genome annotation: From human genetics to biodiversity genomics
title_sort genome annotation: from human genetics to biodiversity genomics
topic Perspective
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10435374/
https://www.ncbi.nlm.nih.gov/pubmed/37601977
http://dx.doi.org/10.1016/j.xgen.2023.100375
work_keys_str_mv AT guigoroderic genomeannotationfromhumangeneticstobiodiversitygenomics