Cargando…
Identification and Characterization of miRNA Transcriptome in Asiatic Cotton (Gossypium arboreum) Using High Throughput Sequencing
MicroRNAs (miRNAs) are small 20–24nt molecules that have been well studied over the past decade due to their important regulatory roles in different cellular processes. The mature sequences are more conserved across vast phylogenetic scales than their precursors and some are conserved within entire...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5471329/ https://www.ncbi.nlm.nih.gov/pubmed/28663752 http://dx.doi.org/10.3389/fpls.2017.00969 |
_version_ | 1783243929262489600 |
---|---|
author | Farooq, Muhammad Mansoor, Shahid Guo, Hui Amin, Imran Chee, Peng W. Azim, M. Kamran Paterson, Andrew H. |
author_facet | Farooq, Muhammad Mansoor, Shahid Guo, Hui Amin, Imran Chee, Peng W. Azim, M. Kamran Paterson, Andrew H. |
author_sort | Farooq, Muhammad |
collection | PubMed |
description | MicroRNAs (miRNAs) are small 20–24nt molecules that have been well studied over the past decade due to their important regulatory roles in different cellular processes. The mature sequences are more conserved across vast phylogenetic scales than their precursors and some are conserved within entire kingdoms, hence, their loci and function can be predicted by homology searches. Different studies have been performed to elucidate miRNAs using de novo prediction methods but due to complex regulatory mechanisms or false positive in silico predictions, not all of them express in reality and sometimes computationally predicted mature transcripts differ from the actual expressed ones. With the availability of a complete genome sequence of Gossypium arboreum, it is important to annotate the genome for both coding and non-coding regions using high confidence transcript evidence, for this cotton species that is highly resistant to various biotic and abiotic stresses. Here we have analyzed the small RNA transcriptome of G. arboreum leaves and provided genome annotation of miRNAs with evidence from miRNA/miRNA(∗) transcripts. A total of 446 miRNAs clustered into 224 miRNA families were found, among which 48 families are conserved in other plants and 176 are novel. Four short RNA libraries were used to shortlist best predictions based on high reads per million. The size, origin, copy numbers and transcript depth of all miRNAs along with their isoforms and targets has been reported. The highest gene copy number was observed for gar-miR7504 followed by gar-miR166, gar-miR8771, gar-miR156, and gar-miR7484. Altogether, 1274 target genes were found in G. arboreum that are enriched for 216 KEGG pathways. The resultant genomic annotations are provided in UCSC, BED format. |
format | Online Article Text |
id | pubmed-5471329 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-54713292017-06-29 Identification and Characterization of miRNA Transcriptome in Asiatic Cotton (Gossypium arboreum) Using High Throughput Sequencing Farooq, Muhammad Mansoor, Shahid Guo, Hui Amin, Imran Chee, Peng W. Azim, M. Kamran Paterson, Andrew H. Front Plant Sci Plant Science MicroRNAs (miRNAs) are small 20–24nt molecules that have been well studied over the past decade due to their important regulatory roles in different cellular processes. The mature sequences are more conserved across vast phylogenetic scales than their precursors and some are conserved within entire kingdoms, hence, their loci and function can be predicted by homology searches. Different studies have been performed to elucidate miRNAs using de novo prediction methods but due to complex regulatory mechanisms or false positive in silico predictions, not all of them express in reality and sometimes computationally predicted mature transcripts differ from the actual expressed ones. With the availability of a complete genome sequence of Gossypium arboreum, it is important to annotate the genome for both coding and non-coding regions using high confidence transcript evidence, for this cotton species that is highly resistant to various biotic and abiotic stresses. Here we have analyzed the small RNA transcriptome of G. arboreum leaves and provided genome annotation of miRNAs with evidence from miRNA/miRNA(∗) transcripts. A total of 446 miRNAs clustered into 224 miRNA families were found, among which 48 families are conserved in other plants and 176 are novel. Four short RNA libraries were used to shortlist best predictions based on high reads per million. The size, origin, copy numbers and transcript depth of all miRNAs along with their isoforms and targets has been reported. The highest gene copy number was observed for gar-miR7504 followed by gar-miR166, gar-miR8771, gar-miR156, and gar-miR7484. Altogether, 1274 target genes were found in G. arboreum that are enriched for 216 KEGG pathways. The resultant genomic annotations are provided in UCSC, BED format. Frontiers Media S.A. 2017-06-15 /pmc/articles/PMC5471329/ /pubmed/28663752 http://dx.doi.org/10.3389/fpls.2017.00969 Text en Copyright © 2017 Farooq, Mansoor, Guo, Amin, Chee, Azim and Paterson. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Plant Science Farooq, Muhammad Mansoor, Shahid Guo, Hui Amin, Imran Chee, Peng W. Azim, M. Kamran Paterson, Andrew H. Identification and Characterization of miRNA Transcriptome in Asiatic Cotton (Gossypium arboreum) Using High Throughput Sequencing |
title | Identification and Characterization of miRNA Transcriptome in Asiatic Cotton (Gossypium arboreum) Using High Throughput Sequencing |
title_full | Identification and Characterization of miRNA Transcriptome in Asiatic Cotton (Gossypium arboreum) Using High Throughput Sequencing |
title_fullStr | Identification and Characterization of miRNA Transcriptome in Asiatic Cotton (Gossypium arboreum) Using High Throughput Sequencing |
title_full_unstemmed | Identification and Characterization of miRNA Transcriptome in Asiatic Cotton (Gossypium arboreum) Using High Throughput Sequencing |
title_short | Identification and Characterization of miRNA Transcriptome in Asiatic Cotton (Gossypium arboreum) Using High Throughput Sequencing |
title_sort | identification and characterization of mirna transcriptome in asiatic cotton (gossypium arboreum) using high throughput sequencing |
topic | Plant Science |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5471329/ https://www.ncbi.nlm.nih.gov/pubmed/28663752 http://dx.doi.org/10.3389/fpls.2017.00969 |
work_keys_str_mv | AT farooqmuhammad identificationandcharacterizationofmirnatranscriptomeinasiaticcottongossypiumarboreumusinghighthroughputsequencing AT mansoorshahid identificationandcharacterizationofmirnatranscriptomeinasiaticcottongossypiumarboreumusinghighthroughputsequencing AT guohui identificationandcharacterizationofmirnatranscriptomeinasiaticcottongossypiumarboreumusinghighthroughputsequencing AT aminimran identificationandcharacterizationofmirnatranscriptomeinasiaticcottongossypiumarboreumusinghighthroughputsequencing AT cheepengw identificationandcharacterizationofmirnatranscriptomeinasiaticcottongossypiumarboreumusinghighthroughputsequencing AT azimmkamran identificationandcharacterizationofmirnatranscriptomeinasiaticcottongossypiumarboreumusinghighthroughputsequencing AT patersonandrewh identificationandcharacterizationofmirnatranscriptomeinasiaticcottongossypiumarboreumusinghighthroughputsequencing |