Cargando…
PacBio Single-Molecule Long-Read Sequencing Provides New Light on the Complexity of Full-Length Transcripts in Cattle
Cattle (Bos taurus) is one of the most widely distributed livestock species in the world, and provides us with high-quality milk and meat which have a huge impact on the quality of human life. Therefore, accurate and complete transcriptome and genome annotation are of great value to the research of...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8437344/ https://www.ncbi.nlm.nih.gov/pubmed/34527015 http://dx.doi.org/10.3389/fgene.2021.664974 |
_version_ | 1783752152757305344 |
---|---|
author | Chang, Tianpeng An, Bingxing Liang, Mang Duan, Xinghai Du, Lili Cai, Wentao Zhu, Bo Gao, Xue Chen, Yan Xu, Lingyang Zhang, Lupei Gao, Huijiang Li, Junya |
author_facet | Chang, Tianpeng An, Bingxing Liang, Mang Duan, Xinghai Du, Lili Cai, Wentao Zhu, Bo Gao, Xue Chen, Yan Xu, Lingyang Zhang, Lupei Gao, Huijiang Li, Junya |
author_sort | Chang, Tianpeng |
collection | PubMed |
description | Cattle (Bos taurus) is one of the most widely distributed livestock species in the world, and provides us with high-quality milk and meat which have a huge impact on the quality of human life. Therefore, accurate and complete transcriptome and genome annotation are of great value to the research of cattle breeding. In this study, we used error-corrected PacBio single-molecule real-time (SMRT) data to perform whole-transcriptome profiling in cattle. Then, 22.5 Gb of subreads was generated, including 381,423 circular consensus sequences (CCSs), among which 276,295 full-length non-chimeric (FLNC) sequences were identified. After correction by Illumina short reads, we obtained 22,353 error-corrected isoforms. A total of 305 alternative splicing (AS) events and 3,795 alternative polyadenylation (APA) sites were detected by transcriptome structural analysis. Furthermore, we identified 457 novel genes, 120 putative transcription factors (TFs), and 569 novel long non-coding RNAs (lncRNAs). Taken together, this research improves our understanding and provides new insights into the complexity of full-length transcripts in cattle. |
format | Online Article Text |
id | pubmed-8437344 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-84373442021-09-14 PacBio Single-Molecule Long-Read Sequencing Provides New Light on the Complexity of Full-Length Transcripts in Cattle Chang, Tianpeng An, Bingxing Liang, Mang Duan, Xinghai Du, Lili Cai, Wentao Zhu, Bo Gao, Xue Chen, Yan Xu, Lingyang Zhang, Lupei Gao, Huijiang Li, Junya Front Genet Genetics Cattle (Bos taurus) is one of the most widely distributed livestock species in the world, and provides us with high-quality milk and meat which have a huge impact on the quality of human life. Therefore, accurate and complete transcriptome and genome annotation are of great value to the research of cattle breeding. In this study, we used error-corrected PacBio single-molecule real-time (SMRT) data to perform whole-transcriptome profiling in cattle. Then, 22.5 Gb of subreads was generated, including 381,423 circular consensus sequences (CCSs), among which 276,295 full-length non-chimeric (FLNC) sequences were identified. After correction by Illumina short reads, we obtained 22,353 error-corrected isoforms. A total of 305 alternative splicing (AS) events and 3,795 alternative polyadenylation (APA) sites were detected by transcriptome structural analysis. Furthermore, we identified 457 novel genes, 120 putative transcription factors (TFs), and 569 novel long non-coding RNAs (lncRNAs). Taken together, this research improves our understanding and provides new insights into the complexity of full-length transcripts in cattle. Frontiers Media S.A. 2021-08-30 /pmc/articles/PMC8437344/ /pubmed/34527015 http://dx.doi.org/10.3389/fgene.2021.664974 Text en Copyright © 2021 Chang, An, Liang, Duan, Du, Cai, Zhu, Gao, Chen, Xu, Zhang, Gao and Li. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Chang, Tianpeng An, Bingxing Liang, Mang Duan, Xinghai Du, Lili Cai, Wentao Zhu, Bo Gao, Xue Chen, Yan Xu, Lingyang Zhang, Lupei Gao, Huijiang Li, Junya PacBio Single-Molecule Long-Read Sequencing Provides New Light on the Complexity of Full-Length Transcripts in Cattle |
title | PacBio Single-Molecule Long-Read Sequencing Provides New Light on the Complexity of Full-Length Transcripts in Cattle |
title_full | PacBio Single-Molecule Long-Read Sequencing Provides New Light on the Complexity of Full-Length Transcripts in Cattle |
title_fullStr | PacBio Single-Molecule Long-Read Sequencing Provides New Light on the Complexity of Full-Length Transcripts in Cattle |
title_full_unstemmed | PacBio Single-Molecule Long-Read Sequencing Provides New Light on the Complexity of Full-Length Transcripts in Cattle |
title_short | PacBio Single-Molecule Long-Read Sequencing Provides New Light on the Complexity of Full-Length Transcripts in Cattle |
title_sort | pacbio single-molecule long-read sequencing provides new light on the complexity of full-length transcripts in cattle |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8437344/ https://www.ncbi.nlm.nih.gov/pubmed/34527015 http://dx.doi.org/10.3389/fgene.2021.664974 |
work_keys_str_mv | AT changtianpeng pacbiosinglemoleculelongreadsequencingprovidesnewlightonthecomplexityoffulllengthtranscriptsincattle AT anbingxing pacbiosinglemoleculelongreadsequencingprovidesnewlightonthecomplexityoffulllengthtranscriptsincattle AT liangmang pacbiosinglemoleculelongreadsequencingprovidesnewlightonthecomplexityoffulllengthtranscriptsincattle AT duanxinghai pacbiosinglemoleculelongreadsequencingprovidesnewlightonthecomplexityoffulllengthtranscriptsincattle AT dulili pacbiosinglemoleculelongreadsequencingprovidesnewlightonthecomplexityoffulllengthtranscriptsincattle AT caiwentao pacbiosinglemoleculelongreadsequencingprovidesnewlightonthecomplexityoffulllengthtranscriptsincattle AT zhubo pacbiosinglemoleculelongreadsequencingprovidesnewlightonthecomplexityoffulllengthtranscriptsincattle AT gaoxue pacbiosinglemoleculelongreadsequencingprovidesnewlightonthecomplexityoffulllengthtranscriptsincattle AT chenyan pacbiosinglemoleculelongreadsequencingprovidesnewlightonthecomplexityoffulllengthtranscriptsincattle AT xulingyang pacbiosinglemoleculelongreadsequencingprovidesnewlightonthecomplexityoffulllengthtranscriptsincattle AT zhanglupei pacbiosinglemoleculelongreadsequencingprovidesnewlightonthecomplexityoffulllengthtranscriptsincattle AT gaohuijiang pacbiosinglemoleculelongreadsequencingprovidesnewlightonthecomplexityoffulllengthtranscriptsincattle AT lijunya pacbiosinglemoleculelongreadsequencingprovidesnewlightonthecomplexityoffulllengthtranscriptsincattle |