Cargando…
Characterization and complexity of transcriptome in Gymnocypris przewalskii using single-molecule long-read sequencing and RNA-seq
The Tibetan Schizothoracinae fish Gymnocypris przewalskii has the ability to adapt to the extreme plateau environment, making it an ideal biological material for evolutionary biology research. However, the lack of well-annotated reference genomes has limited the study of the molecular genetics of G....
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8320875/ https://www.ncbi.nlm.nih.gov/pubmed/33989386 http://dx.doi.org/10.1093/dnares/dsab005 |
_version_ | 1783730718897078272 |
---|---|
author | Li, Xindan Wu, Jinming Xiao, Xinping Rong, Yifeng Yang, Haile Li, Junyi Zhou, Qiong Zhou, Weiguo Shi, Jianquan Qi, Hongfang Du, Hao |
author_facet | Li, Xindan Wu, Jinming Xiao, Xinping Rong, Yifeng Yang, Haile Li, Junyi Zhou, Qiong Zhou, Weiguo Shi, Jianquan Qi, Hongfang Du, Hao |
author_sort | Li, Xindan |
collection | PubMed |
description | The Tibetan Schizothoracinae fish Gymnocypris przewalskii has the ability to adapt to the extreme plateau environment, making it an ideal biological material for evolutionary biology research. However, the lack of well-annotated reference genomes has limited the study of the molecular genetics of G. przewalskii. To characterize its transcriptome features, we first used long-read sequencing technology in combination with RNA-seq for transcriptomic analysis. A total of 159,053 full-length (FL) transcripts were captured by Iso-Seq, having a mean length of 3,445 bp with N50 value of 4,348. Of all FL transcripts, 145,169 were well-annotated in the public database and 134,537 contained complete open reading frames. There were 4,149 pairs of alternative splicing events, of which three randomly selected were defined by RT–PCR and sequencing, and 13,293 long non-coding RNAs detected, based on all-vs.-all BLAST. A total of 118,185 perfect simple sequence repeats were identified from FL transcripts. The FL transcriptome might provide basis for further research of G. przewalskii. |
format | Online Article Text |
id | pubmed-8320875 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-83208752021-07-30 Characterization and complexity of transcriptome in Gymnocypris przewalskii using single-molecule long-read sequencing and RNA-seq Li, Xindan Wu, Jinming Xiao, Xinping Rong, Yifeng Yang, Haile Li, Junyi Zhou, Qiong Zhou, Weiguo Shi, Jianquan Qi, Hongfang Du, Hao DNA Res Resource Article: Genomes Explored The Tibetan Schizothoracinae fish Gymnocypris przewalskii has the ability to adapt to the extreme plateau environment, making it an ideal biological material for evolutionary biology research. However, the lack of well-annotated reference genomes has limited the study of the molecular genetics of G. przewalskii. To characterize its transcriptome features, we first used long-read sequencing technology in combination with RNA-seq for transcriptomic analysis. A total of 159,053 full-length (FL) transcripts were captured by Iso-Seq, having a mean length of 3,445 bp with N50 value of 4,348. Of all FL transcripts, 145,169 were well-annotated in the public database and 134,537 contained complete open reading frames. There were 4,149 pairs of alternative splicing events, of which three randomly selected were defined by RT–PCR and sequencing, and 13,293 long non-coding RNAs detected, based on all-vs.-all BLAST. A total of 118,185 perfect simple sequence repeats were identified from FL transcripts. The FL transcriptome might provide basis for further research of G. przewalskii. Oxford University Press 2021-05-14 /pmc/articles/PMC8320875/ /pubmed/33989386 http://dx.doi.org/10.1093/dnares/dsab005 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Kazusa DNA Research Institute. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Resource Article: Genomes Explored Li, Xindan Wu, Jinming Xiao, Xinping Rong, Yifeng Yang, Haile Li, Junyi Zhou, Qiong Zhou, Weiguo Shi, Jianquan Qi, Hongfang Du, Hao Characterization and complexity of transcriptome in Gymnocypris przewalskii using single-molecule long-read sequencing and RNA-seq |
title | Characterization and complexity of transcriptome in Gymnocypris przewalskii using single-molecule long-read sequencing and RNA-seq |
title_full | Characterization and complexity of transcriptome in Gymnocypris przewalskii using single-molecule long-read sequencing and RNA-seq |
title_fullStr | Characterization and complexity of transcriptome in Gymnocypris przewalskii using single-molecule long-read sequencing and RNA-seq |
title_full_unstemmed | Characterization and complexity of transcriptome in Gymnocypris przewalskii using single-molecule long-read sequencing and RNA-seq |
title_short | Characterization and complexity of transcriptome in Gymnocypris przewalskii using single-molecule long-read sequencing and RNA-seq |
title_sort | characterization and complexity of transcriptome in gymnocypris przewalskii using single-molecule long-read sequencing and rna-seq |
topic | Resource Article: Genomes Explored |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8320875/ https://www.ncbi.nlm.nih.gov/pubmed/33989386 http://dx.doi.org/10.1093/dnares/dsab005 |
work_keys_str_mv | AT lixindan characterizationandcomplexityoftranscriptomeingymnocyprisprzewalskiiusingsinglemoleculelongreadsequencingandrnaseq AT wujinming characterizationandcomplexityoftranscriptomeingymnocyprisprzewalskiiusingsinglemoleculelongreadsequencingandrnaseq AT xiaoxinping characterizationandcomplexityoftranscriptomeingymnocyprisprzewalskiiusingsinglemoleculelongreadsequencingandrnaseq AT rongyifeng characterizationandcomplexityoftranscriptomeingymnocyprisprzewalskiiusingsinglemoleculelongreadsequencingandrnaseq AT yanghaile characterizationandcomplexityoftranscriptomeingymnocyprisprzewalskiiusingsinglemoleculelongreadsequencingandrnaseq AT lijunyi characterizationandcomplexityoftranscriptomeingymnocyprisprzewalskiiusingsinglemoleculelongreadsequencingandrnaseq AT zhouqiong characterizationandcomplexityoftranscriptomeingymnocyprisprzewalskiiusingsinglemoleculelongreadsequencingandrnaseq AT zhouweiguo characterizationandcomplexityoftranscriptomeingymnocyprisprzewalskiiusingsinglemoleculelongreadsequencingandrnaseq AT shijianquan characterizationandcomplexityoftranscriptomeingymnocyprisprzewalskiiusingsinglemoleculelongreadsequencingandrnaseq AT qihongfang characterizationandcomplexityoftranscriptomeingymnocyprisprzewalskiiusingsinglemoleculelongreadsequencingandrnaseq AT duhao characterizationandcomplexityoftranscriptomeingymnocyprisprzewalskiiusingsinglemoleculelongreadsequencingandrnaseq |