Cargando…

ORF Capture-Seq as a versatile method for targeted identification of full-length isoforms

Most human protein-coding genes are expressed as multiple isoforms, which greatly expands the functional repertoire of the encoded proteome. While at least one reliable open reading frame (ORF) model has been assigned for every coding gene, the majority of alternative isoforms remains uncharacterize...

Descripción completa

Detalles Bibliográficos
Autores principales: Sheynkman, Gloria M., Tuttle, Katharine S., Laval, Florent, Tseng, Elizabeth, Underwood, Jason G., Yu, Liang, Dong, Da, Smith, Melissa L., Sebra, Robert, Willems, Luc, Hao, Tong, Calderwood, Michael A., Hill, David E., Vidal, Marc
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7214433/
https://www.ncbi.nlm.nih.gov/pubmed/32393825
http://dx.doi.org/10.1038/s41467-020-16174-z
_version_ 1783531969825472512
author Sheynkman, Gloria M.
Tuttle, Katharine S.
Laval, Florent
Tseng, Elizabeth
Underwood, Jason G.
Yu, Liang
Dong, Da
Smith, Melissa L.
Sebra, Robert
Willems, Luc
Hao, Tong
Calderwood, Michael A.
Hill, David E.
Vidal, Marc
author_facet Sheynkman, Gloria M.
Tuttle, Katharine S.
Laval, Florent
Tseng, Elizabeth
Underwood, Jason G.
Yu, Liang
Dong, Da
Smith, Melissa L.
Sebra, Robert
Willems, Luc
Hao, Tong
Calderwood, Michael A.
Hill, David E.
Vidal, Marc
author_sort Sheynkman, Gloria M.
collection PubMed
description Most human protein-coding genes are expressed as multiple isoforms, which greatly expands the functional repertoire of the encoded proteome. While at least one reliable open reading frame (ORF) model has been assigned for every coding gene, the majority of alternative isoforms remains uncharacterized due to (i) vast differences of overall levels between different isoforms expressed from common genes, and (ii) the difficulty of obtaining full-length transcript sequences. Here, we present ORF Capture-Seq (OCS), a flexible method that addresses both challenges for targeted full-length isoform sequencing applications using collections of cloned ORFs as probes. As a proof-of-concept, we show that an OCS pipeline focused on genes coding for transcription factors increases isoform detection by an order of magnitude when compared to unenriched samples. In short, OCS enables rapid discovery of isoforms from custom-selected genes and will accelerate mapping of the human transcriptome.
format Online
Article
Text
id pubmed-7214433
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-72144332020-05-14 ORF Capture-Seq as a versatile method for targeted identification of full-length isoforms Sheynkman, Gloria M. Tuttle, Katharine S. Laval, Florent Tseng, Elizabeth Underwood, Jason G. Yu, Liang Dong, Da Smith, Melissa L. Sebra, Robert Willems, Luc Hao, Tong Calderwood, Michael A. Hill, David E. Vidal, Marc Nat Commun Article Most human protein-coding genes are expressed as multiple isoforms, which greatly expands the functional repertoire of the encoded proteome. While at least one reliable open reading frame (ORF) model has been assigned for every coding gene, the majority of alternative isoforms remains uncharacterized due to (i) vast differences of overall levels between different isoforms expressed from common genes, and (ii) the difficulty of obtaining full-length transcript sequences. Here, we present ORF Capture-Seq (OCS), a flexible method that addresses both challenges for targeted full-length isoform sequencing applications using collections of cloned ORFs as probes. As a proof-of-concept, we show that an OCS pipeline focused on genes coding for transcription factors increases isoform detection by an order of magnitude when compared to unenriched samples. In short, OCS enables rapid discovery of isoforms from custom-selected genes and will accelerate mapping of the human transcriptome. Nature Publishing Group UK 2020-05-11 /pmc/articles/PMC7214433/ /pubmed/32393825 http://dx.doi.org/10.1038/s41467-020-16174-z Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Sheynkman, Gloria M.
Tuttle, Katharine S.
Laval, Florent
Tseng, Elizabeth
Underwood, Jason G.
Yu, Liang
Dong, Da
Smith, Melissa L.
Sebra, Robert
Willems, Luc
Hao, Tong
Calderwood, Michael A.
Hill, David E.
Vidal, Marc
ORF Capture-Seq as a versatile method for targeted identification of full-length isoforms
title ORF Capture-Seq as a versatile method for targeted identification of full-length isoforms
title_full ORF Capture-Seq as a versatile method for targeted identification of full-length isoforms
title_fullStr ORF Capture-Seq as a versatile method for targeted identification of full-length isoforms
title_full_unstemmed ORF Capture-Seq as a versatile method for targeted identification of full-length isoforms
title_short ORF Capture-Seq as a versatile method for targeted identification of full-length isoforms
title_sort orf capture-seq as a versatile method for targeted identification of full-length isoforms
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7214433/
https://www.ncbi.nlm.nih.gov/pubmed/32393825
http://dx.doi.org/10.1038/s41467-020-16174-z
work_keys_str_mv AT sheynkmangloriam orfcaptureseqasaversatilemethodfortargetedidentificationoffulllengthisoforms
AT tuttlekatharines orfcaptureseqasaversatilemethodfortargetedidentificationoffulllengthisoforms
AT lavalflorent orfcaptureseqasaversatilemethodfortargetedidentificationoffulllengthisoforms
AT tsengelizabeth orfcaptureseqasaversatilemethodfortargetedidentificationoffulllengthisoforms
AT underwoodjasong orfcaptureseqasaversatilemethodfortargetedidentificationoffulllengthisoforms
AT yuliang orfcaptureseqasaversatilemethodfortargetedidentificationoffulllengthisoforms
AT dongda orfcaptureseqasaversatilemethodfortargetedidentificationoffulllengthisoforms
AT smithmelissal orfcaptureseqasaversatilemethodfortargetedidentificationoffulllengthisoforms
AT sebrarobert orfcaptureseqasaversatilemethodfortargetedidentificationoffulllengthisoforms
AT willemsluc orfcaptureseqasaversatilemethodfortargetedidentificationoffulllengthisoforms
AT haotong orfcaptureseqasaversatilemethodfortargetedidentificationoffulllengthisoforms
AT calderwoodmichaela orfcaptureseqasaversatilemethodfortargetedidentificationoffulllengthisoforms
AT hilldavide orfcaptureseqasaversatilemethodfortargetedidentificationoffulllengthisoforms
AT vidalmarc orfcaptureseqasaversatilemethodfortargetedidentificationoffulllengthisoforms