Cargando…

Long-read sequencing of the human cytomegalovirus transcriptome with the Pacific Biosciences RSII platform

Long-read RNA sequencing allows for the precise characterization of full-length transcripts, which makes it an indispensable tool in transcriptomics. The human cytomegalovirus (HCMV) genome has been first sequenced in 1989 and although short-read sequencing studies have uncovered much of the complex...

Descripción completa

Detalles Bibliográficos
Autores principales: Balázs, Zsolt, Tombácz, Dóra, Szűcs, Attila, Snyder, Michael, Boldogkői, Zsolt
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5735922/
https://www.ncbi.nlm.nih.gov/pubmed/29257134
http://dx.doi.org/10.1038/sdata.2017.194
_version_ 1783287294356094976
author Balázs, Zsolt
Tombácz, Dóra
Szűcs, Attila
Snyder, Michael
Boldogkői, Zsolt
author_facet Balázs, Zsolt
Tombácz, Dóra
Szűcs, Attila
Snyder, Michael
Boldogkői, Zsolt
author_sort Balázs, Zsolt
collection PubMed
description Long-read RNA sequencing allows for the precise characterization of full-length transcripts, which makes it an indispensable tool in transcriptomics. The human cytomegalovirus (HCMV) genome has been first sequenced in 1989 and although short-read sequencing studies have uncovered much of the complexity of its transcriptome, only few of its transcripts have been fully annotated. We hereby present a long-read RNA sequencing dataset of HCMV infected human lung fibroblast cells sequenced by the Pacific Biosciences RSII platform. Seven SMRT cells were sequenced using oligo(dT) primers to reverse transcribe poly(A)-selected RNA molecules and one library was prepared using random primers for the reverse transcription of the rRNA-depleted sample. Our dataset contains 122,636 human and 33,086 viral (HMCV strain Towne) reads. The described data include raw and processed sequencing files, and combined with other datasets, they can be used to validate transcriptome analysis tools, to compare library preparation methods, to test base calling algorithms or to identify genetic variants.
format Online
Article
Text
id pubmed-5735922
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-57359222017-12-21 Long-read sequencing of the human cytomegalovirus transcriptome with the Pacific Biosciences RSII platform Balázs, Zsolt Tombácz, Dóra Szűcs, Attila Snyder, Michael Boldogkői, Zsolt Sci Data Data Descriptor Long-read RNA sequencing allows for the precise characterization of full-length transcripts, which makes it an indispensable tool in transcriptomics. The human cytomegalovirus (HCMV) genome has been first sequenced in 1989 and although short-read sequencing studies have uncovered much of the complexity of its transcriptome, only few of its transcripts have been fully annotated. We hereby present a long-read RNA sequencing dataset of HCMV infected human lung fibroblast cells sequenced by the Pacific Biosciences RSII platform. Seven SMRT cells were sequenced using oligo(dT) primers to reverse transcribe poly(A)-selected RNA molecules and one library was prepared using random primers for the reverse transcription of the rRNA-depleted sample. Our dataset contains 122,636 human and 33,086 viral (HMCV strain Towne) reads. The described data include raw and processed sequencing files, and combined with other datasets, they can be used to validate transcriptome analysis tools, to compare library preparation methods, to test base calling algorithms or to identify genetic variants. Nature Publishing Group 2017-12-19 /pmc/articles/PMC5735922/ /pubmed/29257134 http://dx.doi.org/10.1038/sdata.2017.194 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files made available in this article.
spellingShingle Data Descriptor
Balázs, Zsolt
Tombácz, Dóra
Szűcs, Attila
Snyder, Michael
Boldogkői, Zsolt
Long-read sequencing of the human cytomegalovirus transcriptome with the Pacific Biosciences RSII platform
title Long-read sequencing of the human cytomegalovirus transcriptome with the Pacific Biosciences RSII platform
title_full Long-read sequencing of the human cytomegalovirus transcriptome with the Pacific Biosciences RSII platform
title_fullStr Long-read sequencing of the human cytomegalovirus transcriptome with the Pacific Biosciences RSII platform
title_full_unstemmed Long-read sequencing of the human cytomegalovirus transcriptome with the Pacific Biosciences RSII platform
title_short Long-read sequencing of the human cytomegalovirus transcriptome with the Pacific Biosciences RSII platform
title_sort long-read sequencing of the human cytomegalovirus transcriptome with the pacific biosciences rsii platform
topic Data Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5735922/
https://www.ncbi.nlm.nih.gov/pubmed/29257134
http://dx.doi.org/10.1038/sdata.2017.194
work_keys_str_mv AT balazszsolt longreadsequencingofthehumancytomegalovirustranscriptomewiththepacificbiosciencesrsiiplatform
AT tombaczdora longreadsequencingofthehumancytomegalovirustranscriptomewiththepacificbiosciencesrsiiplatform
AT szucsattila longreadsequencingofthehumancytomegalovirustranscriptomewiththepacificbiosciencesrsiiplatform
AT snydermichael longreadsequencingofthehumancytomegalovirustranscriptomewiththepacificbiosciencesrsiiplatform
AT boldogkoizsolt longreadsequencingofthehumancytomegalovirustranscriptomewiththepacificbiosciencesrsiiplatform