Cargando…
Long-read sequencing of the human cytomegalovirus transcriptome with the Pacific Biosciences RSII platform
Long-read RNA sequencing allows for the precise characterization of full-length transcripts, which makes it an indispensable tool in transcriptomics. The human cytomegalovirus (HCMV) genome has been first sequenced in 1989 and although short-read sequencing studies have uncovered much of the complex...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5735922/ https://www.ncbi.nlm.nih.gov/pubmed/29257134 http://dx.doi.org/10.1038/sdata.2017.194 |
_version_ | 1783287294356094976 |
---|---|
author | Balázs, Zsolt Tombácz, Dóra Szűcs, Attila Snyder, Michael Boldogkői, Zsolt |
author_facet | Balázs, Zsolt Tombácz, Dóra Szűcs, Attila Snyder, Michael Boldogkői, Zsolt |
author_sort | Balázs, Zsolt |
collection | PubMed |
description | Long-read RNA sequencing allows for the precise characterization of full-length transcripts, which makes it an indispensable tool in transcriptomics. The human cytomegalovirus (HCMV) genome has been first sequenced in 1989 and although short-read sequencing studies have uncovered much of the complexity of its transcriptome, only few of its transcripts have been fully annotated. We hereby present a long-read RNA sequencing dataset of HCMV infected human lung fibroblast cells sequenced by the Pacific Biosciences RSII platform. Seven SMRT cells were sequenced using oligo(dT) primers to reverse transcribe poly(A)-selected RNA molecules and one library was prepared using random primers for the reverse transcription of the rRNA-depleted sample. Our dataset contains 122,636 human and 33,086 viral (HMCV strain Towne) reads. The described data include raw and processed sequencing files, and combined with other datasets, they can be used to validate transcriptome analysis tools, to compare library preparation methods, to test base calling algorithms or to identify genetic variants. |
format | Online Article Text |
id | pubmed-5735922 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-57359222017-12-21 Long-read sequencing of the human cytomegalovirus transcriptome with the Pacific Biosciences RSII platform Balázs, Zsolt Tombácz, Dóra Szűcs, Attila Snyder, Michael Boldogkői, Zsolt Sci Data Data Descriptor Long-read RNA sequencing allows for the precise characterization of full-length transcripts, which makes it an indispensable tool in transcriptomics. The human cytomegalovirus (HCMV) genome has been first sequenced in 1989 and although short-read sequencing studies have uncovered much of the complexity of its transcriptome, only few of its transcripts have been fully annotated. We hereby present a long-read RNA sequencing dataset of HCMV infected human lung fibroblast cells sequenced by the Pacific Biosciences RSII platform. Seven SMRT cells were sequenced using oligo(dT) primers to reverse transcribe poly(A)-selected RNA molecules and one library was prepared using random primers for the reverse transcription of the rRNA-depleted sample. Our dataset contains 122,636 human and 33,086 viral (HMCV strain Towne) reads. The described data include raw and processed sequencing files, and combined with other datasets, they can be used to validate transcriptome analysis tools, to compare library preparation methods, to test base calling algorithms or to identify genetic variants. Nature Publishing Group 2017-12-19 /pmc/articles/PMC5735922/ /pubmed/29257134 http://dx.doi.org/10.1038/sdata.2017.194 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files made available in this article. |
spellingShingle | Data Descriptor Balázs, Zsolt Tombácz, Dóra Szűcs, Attila Snyder, Michael Boldogkői, Zsolt Long-read sequencing of the human cytomegalovirus transcriptome with the Pacific Biosciences RSII platform |
title | Long-read sequencing of the human cytomegalovirus transcriptome with the Pacific Biosciences RSII platform |
title_full | Long-read sequencing of the human cytomegalovirus transcriptome with the Pacific Biosciences RSII platform |
title_fullStr | Long-read sequencing of the human cytomegalovirus transcriptome with the Pacific Biosciences RSII platform |
title_full_unstemmed | Long-read sequencing of the human cytomegalovirus transcriptome with the Pacific Biosciences RSII platform |
title_short | Long-read sequencing of the human cytomegalovirus transcriptome with the Pacific Biosciences RSII platform |
title_sort | long-read sequencing of the human cytomegalovirus transcriptome with the pacific biosciences rsii platform |
topic | Data Descriptor |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5735922/ https://www.ncbi.nlm.nih.gov/pubmed/29257134 http://dx.doi.org/10.1038/sdata.2017.194 |
work_keys_str_mv | AT balazszsolt longreadsequencingofthehumancytomegalovirustranscriptomewiththepacificbiosciencesrsiiplatform AT tombaczdora longreadsequencingofthehumancytomegalovirustranscriptomewiththepacificbiosciencesrsiiplatform AT szucsattila longreadsequencingofthehumancytomegalovirustranscriptomewiththepacificbiosciencesrsiiplatform AT snydermichael longreadsequencingofthehumancytomegalovirustranscriptomewiththepacificbiosciencesrsiiplatform AT boldogkoizsolt longreadsequencingofthehumancytomegalovirustranscriptomewiththepacificbiosciencesrsiiplatform |