Cargando…

ORFLine: a bioinformatic pipeline to prioritize small open reading frames identifies candidate secreted small proteins from lymphocytes

MOTIVATION: The annotation of small open reading frames (smORFs) of <100 codons (<300 nucleotides) is challenging due to the large number of such sequences in the genome. RESULTS: In this study, we developed a computational pipeline, which we have named ORFLine, that stringently identifies smO...

Descripción completa

Detalles Bibliográficos
Autores principales: Hu, Fengyuan, Lu, Jia, Matheson, Louise S, Díaz-Muñoz, Manuel D, Saveliev, Alexander, Xu, Jinbo, Turner, Martin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8504629/
https://www.ncbi.nlm.nih.gov/pubmed/33970232
http://dx.doi.org/10.1093/bioinformatics/btab339
_version_ 1784581358608187392
author Hu, Fengyuan
Lu, Jia
Matheson, Louise S
Díaz-Muñoz, Manuel D
Saveliev, Alexander
Xu, Jinbo
Turner, Martin
author_facet Hu, Fengyuan
Lu, Jia
Matheson, Louise S
Díaz-Muñoz, Manuel D
Saveliev, Alexander
Xu, Jinbo
Turner, Martin
author_sort Hu, Fengyuan
collection PubMed
description MOTIVATION: The annotation of small open reading frames (smORFs) of <100 codons (<300 nucleotides) is challenging due to the large number of such sequences in the genome. RESULTS: In this study, we developed a computational pipeline, which we have named ORFLine, that stringently identifies smORFs and classifies them according to their position within transcripts. We identified a total of 5744 unique smORFs in datasets from mouse B and T lymphocytes and systematically characterized them using ORFLine. We further searched smORFs for the presence of a signal peptide, which predicted known secreted chemokines as well as novel micropeptides. Four novel micropeptides show evidence of secretion and are therefore candidate mediators of immunoregulatory functions. AVAILABILITY AND IMPLEMENTATION: Freely available on the web at https://github.com/boboppie/ORFLine. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-8504629
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-85046292021-10-13 ORFLine: a bioinformatic pipeline to prioritize small open reading frames identifies candidate secreted small proteins from lymphocytes Hu, Fengyuan Lu, Jia Matheson, Louise S Díaz-Muñoz, Manuel D Saveliev, Alexander Xu, Jinbo Turner, Martin Bioinformatics Original Papers MOTIVATION: The annotation of small open reading frames (smORFs) of <100 codons (<300 nucleotides) is challenging due to the large number of such sequences in the genome. RESULTS: In this study, we developed a computational pipeline, which we have named ORFLine, that stringently identifies smORFs and classifies them according to their position within transcripts. We identified a total of 5744 unique smORFs in datasets from mouse B and T lymphocytes and systematically characterized them using ORFLine. We further searched smORFs for the presence of a signal peptide, which predicted known secreted chemokines as well as novel micropeptides. Four novel micropeptides show evidence of secretion and are therefore candidate mediators of immunoregulatory functions. AVAILABILITY AND IMPLEMENTATION: Freely available on the web at https://github.com/boboppie/ORFLine. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-05-10 /pmc/articles/PMC8504629/ /pubmed/33970232 http://dx.doi.org/10.1093/bioinformatics/btab339 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Hu, Fengyuan
Lu, Jia
Matheson, Louise S
Díaz-Muñoz, Manuel D
Saveliev, Alexander
Xu, Jinbo
Turner, Martin
ORFLine: a bioinformatic pipeline to prioritize small open reading frames identifies candidate secreted small proteins from lymphocytes
title ORFLine: a bioinformatic pipeline to prioritize small open reading frames identifies candidate secreted small proteins from lymphocytes
title_full ORFLine: a bioinformatic pipeline to prioritize small open reading frames identifies candidate secreted small proteins from lymphocytes
title_fullStr ORFLine: a bioinformatic pipeline to prioritize small open reading frames identifies candidate secreted small proteins from lymphocytes
title_full_unstemmed ORFLine: a bioinformatic pipeline to prioritize small open reading frames identifies candidate secreted small proteins from lymphocytes
title_short ORFLine: a bioinformatic pipeline to prioritize small open reading frames identifies candidate secreted small proteins from lymphocytes
title_sort orfline: a bioinformatic pipeline to prioritize small open reading frames identifies candidate secreted small proteins from lymphocytes
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8504629/
https://www.ncbi.nlm.nih.gov/pubmed/33970232
http://dx.doi.org/10.1093/bioinformatics/btab339
work_keys_str_mv AT hufengyuan orflineabioinformaticpipelinetoprioritizesmallopenreadingframesidentifiescandidatesecretedsmallproteinsfromlymphocytes
AT lujia orflineabioinformaticpipelinetoprioritizesmallopenreadingframesidentifiescandidatesecretedsmallproteinsfromlymphocytes
AT mathesonlouises orflineabioinformaticpipelinetoprioritizesmallopenreadingframesidentifiescandidatesecretedsmallproteinsfromlymphocytes
AT diazmunozmanueld orflineabioinformaticpipelinetoprioritizesmallopenreadingframesidentifiescandidatesecretedsmallproteinsfromlymphocytes
AT savelievalexander orflineabioinformaticpipelinetoprioritizesmallopenreadingframesidentifiescandidatesecretedsmallproteinsfromlymphocytes
AT xujinbo orflineabioinformaticpipelinetoprioritizesmallopenreadingframesidentifiescandidatesecretedsmallproteinsfromlymphocytes
AT turnermartin orflineabioinformaticpipelinetoprioritizesmallopenreadingframesidentifiescandidatesecretedsmallproteinsfromlymphocytes