Cargando…

Short open reading frames (sORFs) and microproteins: an update on their identification and validation measures

A short open reading frame (sORFs) constitutes ≤ 300 bases, encoding a microprotein or sORF-encoded protein (SEP) which comprises ≤ 100 amino acids. Traditionally dismissed by genome annotation pipelines as meaningless noise, sORFs were found to possess coding potential with ribosome profiling (RIBO...

Descripción completa

Detalles Bibliográficos
Autores principales: Leong, Alyssa Zi-Xin, Lee, Pey Yee, Mohtar, M. Aiman, Syafruddin, Saiful Effendi, Pung, Yuh-Fen, Low, Teck Yew
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8928697/
https://www.ncbi.nlm.nih.gov/pubmed/35300685
http://dx.doi.org/10.1186/s12929-022-00802-5
_version_ 1784670695466205184
author Leong, Alyssa Zi-Xin
Lee, Pey Yee
Mohtar, M. Aiman
Syafruddin, Saiful Effendi
Pung, Yuh-Fen
Low, Teck Yew
author_facet Leong, Alyssa Zi-Xin
Lee, Pey Yee
Mohtar, M. Aiman
Syafruddin, Saiful Effendi
Pung, Yuh-Fen
Low, Teck Yew
author_sort Leong, Alyssa Zi-Xin
collection PubMed
description A short open reading frame (sORFs) constitutes ≤ 300 bases, encoding a microprotein or sORF-encoded protein (SEP) which comprises ≤ 100 amino acids. Traditionally dismissed by genome annotation pipelines as meaningless noise, sORFs were found to possess coding potential with ribosome profiling (RIBO-Seq), which unveiled sORF-based transcripts at various genome locations. Nonetheless, the existence of corresponding microproteins that are stable and functional was little substantiated by experimental evidence initially. With recent advancements in multi-omics, the identification, validation, and functional characterisation of sORFs and microproteins have become feasible. In this review, we discuss the history and development of an emerging research field of sORFs and microproteins. In particular, we focus on an array of bioinformatics and OMICS approaches used for predicting, sequencing, validating, and characterizing these recently discovered entities. These strategies include RIBO-Seq which detects sORF transcripts via ribosome footprints, and mass spectrometry (MS)-based proteomics for sequencing the resultant microproteins. Subsequently, our discussion extends to the functional characterisation of microproteins by incorporating CRISPR/Cas9 screen and protein–protein interaction (PPI) studies. Our review discusses not only detection methodologies, but we also highlight on the challenges and potential solutions in identifying and validating sORFs and their microproteins. The novelty of this review lies within its validation for the functional role of microproteins, which could contribute towards the future landscape of microproteomics.
format Online
Article
Text
id pubmed-8928697
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-89286972022-03-23 Short open reading frames (sORFs) and microproteins: an update on their identification and validation measures Leong, Alyssa Zi-Xin Lee, Pey Yee Mohtar, M. Aiman Syafruddin, Saiful Effendi Pung, Yuh-Fen Low, Teck Yew J Biomed Sci Review A short open reading frame (sORFs) constitutes ≤ 300 bases, encoding a microprotein or sORF-encoded protein (SEP) which comprises ≤ 100 amino acids. Traditionally dismissed by genome annotation pipelines as meaningless noise, sORFs were found to possess coding potential with ribosome profiling (RIBO-Seq), which unveiled sORF-based transcripts at various genome locations. Nonetheless, the existence of corresponding microproteins that are stable and functional was little substantiated by experimental evidence initially. With recent advancements in multi-omics, the identification, validation, and functional characterisation of sORFs and microproteins have become feasible. In this review, we discuss the history and development of an emerging research field of sORFs and microproteins. In particular, we focus on an array of bioinformatics and OMICS approaches used for predicting, sequencing, validating, and characterizing these recently discovered entities. These strategies include RIBO-Seq which detects sORF transcripts via ribosome footprints, and mass spectrometry (MS)-based proteomics for sequencing the resultant microproteins. Subsequently, our discussion extends to the functional characterisation of microproteins by incorporating CRISPR/Cas9 screen and protein–protein interaction (PPI) studies. Our review discusses not only detection methodologies, but we also highlight on the challenges and potential solutions in identifying and validating sORFs and their microproteins. The novelty of this review lies within its validation for the functional role of microproteins, which could contribute towards the future landscape of microproteomics. BioMed Central 2022-03-17 /pmc/articles/PMC8928697/ /pubmed/35300685 http://dx.doi.org/10.1186/s12929-022-00802-5 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Review
Leong, Alyssa Zi-Xin
Lee, Pey Yee
Mohtar, M. Aiman
Syafruddin, Saiful Effendi
Pung, Yuh-Fen
Low, Teck Yew
Short open reading frames (sORFs) and microproteins: an update on their identification and validation measures
title Short open reading frames (sORFs) and microproteins: an update on their identification and validation measures
title_full Short open reading frames (sORFs) and microproteins: an update on their identification and validation measures
title_fullStr Short open reading frames (sORFs) and microproteins: an update on their identification and validation measures
title_full_unstemmed Short open reading frames (sORFs) and microproteins: an update on their identification and validation measures
title_short Short open reading frames (sORFs) and microproteins: an update on their identification and validation measures
title_sort short open reading frames (sorfs) and microproteins: an update on their identification and validation measures
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8928697/
https://www.ncbi.nlm.nih.gov/pubmed/35300685
http://dx.doi.org/10.1186/s12929-022-00802-5
work_keys_str_mv AT leongalyssazixin shortopenreadingframessorfsandmicroproteinsanupdateontheiridentificationandvalidationmeasures
AT leepeyyee shortopenreadingframessorfsandmicroproteinsanupdateontheiridentificationandvalidationmeasures
AT mohtarmaiman shortopenreadingframessorfsandmicroproteinsanupdateontheiridentificationandvalidationmeasures
AT syafruddinsaifuleffendi shortopenreadingframessorfsandmicroproteinsanupdateontheiridentificationandvalidationmeasures
AT pungyuhfen shortopenreadingframessorfsandmicroproteinsanupdateontheiridentificationandvalidationmeasures
AT lowteckyew shortopenreadingframessorfsandmicroproteinsanupdateontheiridentificationandvalidationmeasures