Cargando…

PacBio Single-Molecule Long-Read Sequencing Provides New Light on the Complexity of Full-Length Transcripts in Cattle

Cattle (Bos taurus) is one of the most widely distributed livestock species in the world, and provides us with high-quality milk and meat which have a huge impact on the quality of human life. Therefore, accurate and complete transcriptome and genome annotation are of great value to the research of...

Descripción completa

Detalles Bibliográficos
Autores principales: Chang, Tianpeng, An, Bingxing, Liang, Mang, Duan, Xinghai, Du, Lili, Cai, Wentao, Zhu, Bo, Gao, Xue, Chen, Yan, Xu, Lingyang, Zhang, Lupei, Gao, Huijiang, Li, Junya
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8437344/
https://www.ncbi.nlm.nih.gov/pubmed/34527015
http://dx.doi.org/10.3389/fgene.2021.664974
_version_ 1783752152757305344
author Chang, Tianpeng
An, Bingxing
Liang, Mang
Duan, Xinghai
Du, Lili
Cai, Wentao
Zhu, Bo
Gao, Xue
Chen, Yan
Xu, Lingyang
Zhang, Lupei
Gao, Huijiang
Li, Junya
author_facet Chang, Tianpeng
An, Bingxing
Liang, Mang
Duan, Xinghai
Du, Lili
Cai, Wentao
Zhu, Bo
Gao, Xue
Chen, Yan
Xu, Lingyang
Zhang, Lupei
Gao, Huijiang
Li, Junya
author_sort Chang, Tianpeng
collection PubMed
description Cattle (Bos taurus) is one of the most widely distributed livestock species in the world, and provides us with high-quality milk and meat which have a huge impact on the quality of human life. Therefore, accurate and complete transcriptome and genome annotation are of great value to the research of cattle breeding. In this study, we used error-corrected PacBio single-molecule real-time (SMRT) data to perform whole-transcriptome profiling in cattle. Then, 22.5 Gb of subreads was generated, including 381,423 circular consensus sequences (CCSs), among which 276,295 full-length non-chimeric (FLNC) sequences were identified. After correction by Illumina short reads, we obtained 22,353 error-corrected isoforms. A total of 305 alternative splicing (AS) events and 3,795 alternative polyadenylation (APA) sites were detected by transcriptome structural analysis. Furthermore, we identified 457 novel genes, 120 putative transcription factors (TFs), and 569 novel long non-coding RNAs (lncRNAs). Taken together, this research improves our understanding and provides new insights into the complexity of full-length transcripts in cattle.
format Online
Article
Text
id pubmed-8437344
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-84373442021-09-14 PacBio Single-Molecule Long-Read Sequencing Provides New Light on the Complexity of Full-Length Transcripts in Cattle Chang, Tianpeng An, Bingxing Liang, Mang Duan, Xinghai Du, Lili Cai, Wentao Zhu, Bo Gao, Xue Chen, Yan Xu, Lingyang Zhang, Lupei Gao, Huijiang Li, Junya Front Genet Genetics Cattle (Bos taurus) is one of the most widely distributed livestock species in the world, and provides us with high-quality milk and meat which have a huge impact on the quality of human life. Therefore, accurate and complete transcriptome and genome annotation are of great value to the research of cattle breeding. In this study, we used error-corrected PacBio single-molecule real-time (SMRT) data to perform whole-transcriptome profiling in cattle. Then, 22.5 Gb of subreads was generated, including 381,423 circular consensus sequences (CCSs), among which 276,295 full-length non-chimeric (FLNC) sequences were identified. After correction by Illumina short reads, we obtained 22,353 error-corrected isoforms. A total of 305 alternative splicing (AS) events and 3,795 alternative polyadenylation (APA) sites were detected by transcriptome structural analysis. Furthermore, we identified 457 novel genes, 120 putative transcription factors (TFs), and 569 novel long non-coding RNAs (lncRNAs). Taken together, this research improves our understanding and provides new insights into the complexity of full-length transcripts in cattle. Frontiers Media S.A. 2021-08-30 /pmc/articles/PMC8437344/ /pubmed/34527015 http://dx.doi.org/10.3389/fgene.2021.664974 Text en Copyright © 2021 Chang, An, Liang, Duan, Du, Cai, Zhu, Gao, Chen, Xu, Zhang, Gao and Li. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Chang, Tianpeng
An, Bingxing
Liang, Mang
Duan, Xinghai
Du, Lili
Cai, Wentao
Zhu, Bo
Gao, Xue
Chen, Yan
Xu, Lingyang
Zhang, Lupei
Gao, Huijiang
Li, Junya
PacBio Single-Molecule Long-Read Sequencing Provides New Light on the Complexity of Full-Length Transcripts in Cattle
title PacBio Single-Molecule Long-Read Sequencing Provides New Light on the Complexity of Full-Length Transcripts in Cattle
title_full PacBio Single-Molecule Long-Read Sequencing Provides New Light on the Complexity of Full-Length Transcripts in Cattle
title_fullStr PacBio Single-Molecule Long-Read Sequencing Provides New Light on the Complexity of Full-Length Transcripts in Cattle
title_full_unstemmed PacBio Single-Molecule Long-Read Sequencing Provides New Light on the Complexity of Full-Length Transcripts in Cattle
title_short PacBio Single-Molecule Long-Read Sequencing Provides New Light on the Complexity of Full-Length Transcripts in Cattle
title_sort pacbio single-molecule long-read sequencing provides new light on the complexity of full-length transcripts in cattle
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8437344/
https://www.ncbi.nlm.nih.gov/pubmed/34527015
http://dx.doi.org/10.3389/fgene.2021.664974
work_keys_str_mv AT changtianpeng pacbiosinglemoleculelongreadsequencingprovidesnewlightonthecomplexityoffulllengthtranscriptsincattle
AT anbingxing pacbiosinglemoleculelongreadsequencingprovidesnewlightonthecomplexityoffulllengthtranscriptsincattle
AT liangmang pacbiosinglemoleculelongreadsequencingprovidesnewlightonthecomplexityoffulllengthtranscriptsincattle
AT duanxinghai pacbiosinglemoleculelongreadsequencingprovidesnewlightonthecomplexityoffulllengthtranscriptsincattle
AT dulili pacbiosinglemoleculelongreadsequencingprovidesnewlightonthecomplexityoffulllengthtranscriptsincattle
AT caiwentao pacbiosinglemoleculelongreadsequencingprovidesnewlightonthecomplexityoffulllengthtranscriptsincattle
AT zhubo pacbiosinglemoleculelongreadsequencingprovidesnewlightonthecomplexityoffulllengthtranscriptsincattle
AT gaoxue pacbiosinglemoleculelongreadsequencingprovidesnewlightonthecomplexityoffulllengthtranscriptsincattle
AT chenyan pacbiosinglemoleculelongreadsequencingprovidesnewlightonthecomplexityoffulllengthtranscriptsincattle
AT xulingyang pacbiosinglemoleculelongreadsequencingprovidesnewlightonthecomplexityoffulllengthtranscriptsincattle
AT zhanglupei pacbiosinglemoleculelongreadsequencingprovidesnewlightonthecomplexityoffulllengthtranscriptsincattle
AT gaohuijiang pacbiosinglemoleculelongreadsequencingprovidesnewlightonthecomplexityoffulllengthtranscriptsincattle
AT lijunya pacbiosinglemoleculelongreadsequencingprovidesnewlightonthecomplexityoffulllengthtranscriptsincattle