Cargando…

Recognition of the polycistronic nature of human genes is critical to understanding the genotype-phenotype relationship

Technological advances promise unprecedented opportunities for whole exome sequencing and proteomic analyses of populations. Currently, data from genome and exome sequencing or proteomic studies are searched against reference genome annotations. This provides the foundation for research and clinical...

Descripción completa

Detalles Bibliográficos
Autores principales: Brunet, Marie A., Levesque, Sébastien A., Hunting, Darel J., Cohen, Alan A., Roucou, Xavier
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5932603/
https://www.ncbi.nlm.nih.gov/pubmed/29626081
http://dx.doi.org/10.1101/gr.230938.117
_version_ 1783319846766772224
author Brunet, Marie A.
Levesque, Sébastien A.
Hunting, Darel J.
Cohen, Alan A.
Roucou, Xavier
author_facet Brunet, Marie A.
Levesque, Sébastien A.
Hunting, Darel J.
Cohen, Alan A.
Roucou, Xavier
author_sort Brunet, Marie A.
collection PubMed
description Technological advances promise unprecedented opportunities for whole exome sequencing and proteomic analyses of populations. Currently, data from genome and exome sequencing or proteomic studies are searched against reference genome annotations. This provides the foundation for research and clinical screening for genetic causes of pathologies. However, current genome annotations substantially underestimate the proteomic information encoded within a gene. Numerous studies have now demonstrated the expression and function of alternative (mainly small, sometimes overlapping) ORFs within mature gene transcripts. This has important consequences for the correlation of phenotypes and genotypes. Most alternative ORFs are not yet annotated because of a lack of evidence, and this absence from databases precludes their detection by standard proteomic methods, such as mass spectrometry. Here, we demonstrate how current approaches tend to overlook alternative ORFs, hindering the discovery of new genetic drivers and fundamental research. We discuss available tools and techniques to improve identification of proteins from alternative ORFs and finally suggest a novel annotation system to permit a more complete representation of the transcriptomic and proteomic information contained within a gene. Given the crucial challenge of distinguishing functional ORFs from random ones, the suggested pipeline emphasizes both experimental data and conservation signatures. The addition of alternative ORFs in databases will render identification less serendipitous and advance the pace of research and genomic knowledge. This review highlights the urgent medical and research need to incorporate alternative ORFs in current genome annotations and thus permit their inclusion in hypotheses and models, which relate phenotypes and genotypes.
format Online
Article
Text
id pubmed-5932603
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-59326032018-05-31 Recognition of the polycistronic nature of human genes is critical to understanding the genotype-phenotype relationship Brunet, Marie A. Levesque, Sébastien A. Hunting, Darel J. Cohen, Alan A. Roucou, Xavier Genome Res Perspective Technological advances promise unprecedented opportunities for whole exome sequencing and proteomic analyses of populations. Currently, data from genome and exome sequencing or proteomic studies are searched against reference genome annotations. This provides the foundation for research and clinical screening for genetic causes of pathologies. However, current genome annotations substantially underestimate the proteomic information encoded within a gene. Numerous studies have now demonstrated the expression and function of alternative (mainly small, sometimes overlapping) ORFs within mature gene transcripts. This has important consequences for the correlation of phenotypes and genotypes. Most alternative ORFs are not yet annotated because of a lack of evidence, and this absence from databases precludes their detection by standard proteomic methods, such as mass spectrometry. Here, we demonstrate how current approaches tend to overlook alternative ORFs, hindering the discovery of new genetic drivers and fundamental research. We discuss available tools and techniques to improve identification of proteins from alternative ORFs and finally suggest a novel annotation system to permit a more complete representation of the transcriptomic and proteomic information contained within a gene. Given the crucial challenge of distinguishing functional ORFs from random ones, the suggested pipeline emphasizes both experimental data and conservation signatures. The addition of alternative ORFs in databases will render identification less serendipitous and advance the pace of research and genomic knowledge. This review highlights the urgent medical and research need to incorporate alternative ORFs in current genome annotations and thus permit their inclusion in hypotheses and models, which relate phenotypes and genotypes. Cold Spring Harbor Laboratory Press 2018-05 /pmc/articles/PMC5932603/ /pubmed/29626081 http://dx.doi.org/10.1101/gr.230938.117 Text en © 2018 Brunet et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by/4.0/ This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/.
spellingShingle Perspective
Brunet, Marie A.
Levesque, Sébastien A.
Hunting, Darel J.
Cohen, Alan A.
Roucou, Xavier
Recognition of the polycistronic nature of human genes is critical to understanding the genotype-phenotype relationship
title Recognition of the polycistronic nature of human genes is critical to understanding the genotype-phenotype relationship
title_full Recognition of the polycistronic nature of human genes is critical to understanding the genotype-phenotype relationship
title_fullStr Recognition of the polycistronic nature of human genes is critical to understanding the genotype-phenotype relationship
title_full_unstemmed Recognition of the polycistronic nature of human genes is critical to understanding the genotype-phenotype relationship
title_short Recognition of the polycistronic nature of human genes is critical to understanding the genotype-phenotype relationship
title_sort recognition of the polycistronic nature of human genes is critical to understanding the genotype-phenotype relationship
topic Perspective
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5932603/
https://www.ncbi.nlm.nih.gov/pubmed/29626081
http://dx.doi.org/10.1101/gr.230938.117
work_keys_str_mv AT brunetmariea recognitionofthepolycistronicnatureofhumangenesiscriticaltounderstandingthegenotypephenotyperelationship
AT levesquesebastiena recognitionofthepolycistronicnatureofhumangenesiscriticaltounderstandingthegenotypephenotyperelationship
AT huntingdarelj recognitionofthepolycistronicnatureofhumangenesiscriticaltounderstandingthegenotypephenotyperelationship
AT cohenalana recognitionofthepolycistronicnatureofhumangenesiscriticaltounderstandingthegenotypephenotyperelationship
AT roucouxavier recognitionofthepolycistronicnatureofhumangenesiscriticaltounderstandingthegenotypephenotyperelationship