Cargando…

Deeply Mining a Universe of Peptides Encoded by Long Noncoding RNAs

Many small ORFs embedded in long noncoding RNA (lncRNA) transcripts have been shown to encode biologically functional polypeptides (small ORF-encoded polypeptides [SEPs]) in different organisms. Despite some novel SEPs have been found, the identification is still hampered by their poor predictabilit...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Qing, Wu, Erzhong, Tang, Yiheng, Cai, Tanxi, Zhang, Lili, Wang, Jifeng, Hao, Yajing, Zhang, Bao, Zhou, Yue, Guo, Xiaojing, Luo, Jianjun, Chen, Runsheng, Yang, Fuquan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Biochemistry and Molecular Biology 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8335655/
https://www.ncbi.nlm.nih.gov/pubmed/34129944
http://dx.doi.org/10.1016/j.mcpro.2021.100109
_version_ 1783733156353933312
author Zhang, Qing
Wu, Erzhong
Tang, Yiheng
Cai, Tanxi
Zhang, Lili
Wang, Jifeng
Hao, Yajing
Zhang, Bao
Zhou, Yue
Guo, Xiaojing
Luo, Jianjun
Chen, Runsheng
Yang, Fuquan
author_facet Zhang, Qing
Wu, Erzhong
Tang, Yiheng
Cai, Tanxi
Zhang, Lili
Wang, Jifeng
Hao, Yajing
Zhang, Bao
Zhou, Yue
Guo, Xiaojing
Luo, Jianjun
Chen, Runsheng
Yang, Fuquan
author_sort Zhang, Qing
collection PubMed
description Many small ORFs embedded in long noncoding RNA (lncRNA) transcripts have been shown to encode biologically functional polypeptides (small ORF-encoded polypeptides [SEPs]) in different organisms. Despite some novel SEPs have been found, the identification is still hampered by their poor predictability, diminutive size, and low relative abundance. Here, we take advantage of NONCODE, a repository containing the most complete collection and annotation of lncRNA transcripts from different species, to build a novel database that attempts to maximize a collection of SEPs from human and mouse lncRNA transcripts. In order to further improve SEP discovery, we implemented two effective and complementary polypeptide enrichment strategies using 30-kDa molecular weight cutoff filter and C8 solid-phase extraction column. These combined strategies enabled us to discover 353 SEPs from eight human cell lines and 409 SEPs from three mouse cell lines and eight mouse tissues. Importantly, 19 of them were then verified through in vitro expression, immunoblotting, parallel reaction monitoring, and synthetic peptides. Subsequent bioinformatics analysis revealed that some of the physical and chemical properties of these novel SEPs, including amino acid composition and codon usage, are different from those commonly found in canonical proteins. Intriguingly, nearly 65% of the identified SEPs were found to be initiated with non-AUG start codons. The 762 novel SEPs probably represent the largest number of SEPs detected by MS reported to date. These novel SEPs might not only provide new clues for the annotation of noncoding elements in the genome but also serve as a valuable resource for functional study.
format Online
Article
Text
id pubmed-8335655
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Society for Biochemistry and Molecular Biology
record_format MEDLINE/PubMed
spelling pubmed-83356552021-08-09 Deeply Mining a Universe of Peptides Encoded by Long Noncoding RNAs Zhang, Qing Wu, Erzhong Tang, Yiheng Cai, Tanxi Zhang, Lili Wang, Jifeng Hao, Yajing Zhang, Bao Zhou, Yue Guo, Xiaojing Luo, Jianjun Chen, Runsheng Yang, Fuquan Mol Cell Proteomics Research Many small ORFs embedded in long noncoding RNA (lncRNA) transcripts have been shown to encode biologically functional polypeptides (small ORF-encoded polypeptides [SEPs]) in different organisms. Despite some novel SEPs have been found, the identification is still hampered by their poor predictability, diminutive size, and low relative abundance. Here, we take advantage of NONCODE, a repository containing the most complete collection and annotation of lncRNA transcripts from different species, to build a novel database that attempts to maximize a collection of SEPs from human and mouse lncRNA transcripts. In order to further improve SEP discovery, we implemented two effective and complementary polypeptide enrichment strategies using 30-kDa molecular weight cutoff filter and C8 solid-phase extraction column. These combined strategies enabled us to discover 353 SEPs from eight human cell lines and 409 SEPs from three mouse cell lines and eight mouse tissues. Importantly, 19 of them were then verified through in vitro expression, immunoblotting, parallel reaction monitoring, and synthetic peptides. Subsequent bioinformatics analysis revealed that some of the physical and chemical properties of these novel SEPs, including amino acid composition and codon usage, are different from those commonly found in canonical proteins. Intriguingly, nearly 65% of the identified SEPs were found to be initiated with non-AUG start codons. The 762 novel SEPs probably represent the largest number of SEPs detected by MS reported to date. These novel SEPs might not only provide new clues for the annotation of noncoding elements in the genome but also serve as a valuable resource for functional study. American Society for Biochemistry and Molecular Biology 2021-06-12 /pmc/articles/PMC8335655/ /pubmed/34129944 http://dx.doi.org/10.1016/j.mcpro.2021.100109 Text en © 2021 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research
Zhang, Qing
Wu, Erzhong
Tang, Yiheng
Cai, Tanxi
Zhang, Lili
Wang, Jifeng
Hao, Yajing
Zhang, Bao
Zhou, Yue
Guo, Xiaojing
Luo, Jianjun
Chen, Runsheng
Yang, Fuquan
Deeply Mining a Universe of Peptides Encoded by Long Noncoding RNAs
title Deeply Mining a Universe of Peptides Encoded by Long Noncoding RNAs
title_full Deeply Mining a Universe of Peptides Encoded by Long Noncoding RNAs
title_fullStr Deeply Mining a Universe of Peptides Encoded by Long Noncoding RNAs
title_full_unstemmed Deeply Mining a Universe of Peptides Encoded by Long Noncoding RNAs
title_short Deeply Mining a Universe of Peptides Encoded by Long Noncoding RNAs
title_sort deeply mining a universe of peptides encoded by long noncoding rnas
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8335655/
https://www.ncbi.nlm.nih.gov/pubmed/34129944
http://dx.doi.org/10.1016/j.mcpro.2021.100109
work_keys_str_mv AT zhangqing deeplyminingauniverseofpeptidesencodedbylongnoncodingrnas
AT wuerzhong deeplyminingauniverseofpeptidesencodedbylongnoncodingrnas
AT tangyiheng deeplyminingauniverseofpeptidesencodedbylongnoncodingrnas
AT caitanxi deeplyminingauniverseofpeptidesencodedbylongnoncodingrnas
AT zhanglili deeplyminingauniverseofpeptidesencodedbylongnoncodingrnas
AT wangjifeng deeplyminingauniverseofpeptidesencodedbylongnoncodingrnas
AT haoyajing deeplyminingauniverseofpeptidesencodedbylongnoncodingrnas
AT zhangbao deeplyminingauniverseofpeptidesencodedbylongnoncodingrnas
AT zhouyue deeplyminingauniverseofpeptidesencodedbylongnoncodingrnas
AT guoxiaojing deeplyminingauniverseofpeptidesencodedbylongnoncodingrnas
AT luojianjun deeplyminingauniverseofpeptidesencodedbylongnoncodingrnas
AT chenrunsheng deeplyminingauniverseofpeptidesencodedbylongnoncodingrnas
AT yangfuquan deeplyminingauniverseofpeptidesencodedbylongnoncodingrnas