Cargando…

Identification and Characterization of miRNA Transcriptome in Asiatic Cotton (Gossypium arboreum) Using High Throughput Sequencing

MicroRNAs (miRNAs) are small 20–24nt molecules that have been well studied over the past decade due to their important regulatory roles in different cellular processes. The mature sequences are more conserved across vast phylogenetic scales than their precursors and some are conserved within entire...

Descripción completa

Detalles Bibliográficos
Autores principales: Farooq, Muhammad, Mansoor, Shahid, Guo, Hui, Amin, Imran, Chee, Peng W., Azim, M. Kamran, Paterson, Andrew H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5471329/
https://www.ncbi.nlm.nih.gov/pubmed/28663752
http://dx.doi.org/10.3389/fpls.2017.00969
_version_ 1783243929262489600
author Farooq, Muhammad
Mansoor, Shahid
Guo, Hui
Amin, Imran
Chee, Peng W.
Azim, M. Kamran
Paterson, Andrew H.
author_facet Farooq, Muhammad
Mansoor, Shahid
Guo, Hui
Amin, Imran
Chee, Peng W.
Azim, M. Kamran
Paterson, Andrew H.
author_sort Farooq, Muhammad
collection PubMed
description MicroRNAs (miRNAs) are small 20–24nt molecules that have been well studied over the past decade due to their important regulatory roles in different cellular processes. The mature sequences are more conserved across vast phylogenetic scales than their precursors and some are conserved within entire kingdoms, hence, their loci and function can be predicted by homology searches. Different studies have been performed to elucidate miRNAs using de novo prediction methods but due to complex regulatory mechanisms or false positive in silico predictions, not all of them express in reality and sometimes computationally predicted mature transcripts differ from the actual expressed ones. With the availability of a complete genome sequence of Gossypium arboreum, it is important to annotate the genome for both coding and non-coding regions using high confidence transcript evidence, for this cotton species that is highly resistant to various biotic and abiotic stresses. Here we have analyzed the small RNA transcriptome of G. arboreum leaves and provided genome annotation of miRNAs with evidence from miRNA/miRNA(∗) transcripts. A total of 446 miRNAs clustered into 224 miRNA families were found, among which 48 families are conserved in other plants and 176 are novel. Four short RNA libraries were used to shortlist best predictions based on high reads per million. The size, origin, copy numbers and transcript depth of all miRNAs along with their isoforms and targets has been reported. The highest gene copy number was observed for gar-miR7504 followed by gar-miR166, gar-miR8771, gar-miR156, and gar-miR7484. Altogether, 1274 target genes were found in G. arboreum that are enriched for 216 KEGG pathways. The resultant genomic annotations are provided in UCSC, BED format.
format Online
Article
Text
id pubmed-5471329
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-54713292017-06-29 Identification and Characterization of miRNA Transcriptome in Asiatic Cotton (Gossypium arboreum) Using High Throughput Sequencing Farooq, Muhammad Mansoor, Shahid Guo, Hui Amin, Imran Chee, Peng W. Azim, M. Kamran Paterson, Andrew H. Front Plant Sci Plant Science MicroRNAs (miRNAs) are small 20–24nt molecules that have been well studied over the past decade due to their important regulatory roles in different cellular processes. The mature sequences are more conserved across vast phylogenetic scales than their precursors and some are conserved within entire kingdoms, hence, their loci and function can be predicted by homology searches. Different studies have been performed to elucidate miRNAs using de novo prediction methods but due to complex regulatory mechanisms or false positive in silico predictions, not all of them express in reality and sometimes computationally predicted mature transcripts differ from the actual expressed ones. With the availability of a complete genome sequence of Gossypium arboreum, it is important to annotate the genome for both coding and non-coding regions using high confidence transcript evidence, for this cotton species that is highly resistant to various biotic and abiotic stresses. Here we have analyzed the small RNA transcriptome of G. arboreum leaves and provided genome annotation of miRNAs with evidence from miRNA/miRNA(∗) transcripts. A total of 446 miRNAs clustered into 224 miRNA families were found, among which 48 families are conserved in other plants and 176 are novel. Four short RNA libraries were used to shortlist best predictions based on high reads per million. The size, origin, copy numbers and transcript depth of all miRNAs along with their isoforms and targets has been reported. The highest gene copy number was observed for gar-miR7504 followed by gar-miR166, gar-miR8771, gar-miR156, and gar-miR7484. Altogether, 1274 target genes were found in G. arboreum that are enriched for 216 KEGG pathways. The resultant genomic annotations are provided in UCSC, BED format. Frontiers Media S.A. 2017-06-15 /pmc/articles/PMC5471329/ /pubmed/28663752 http://dx.doi.org/10.3389/fpls.2017.00969 Text en Copyright © 2017 Farooq, Mansoor, Guo, Amin, Chee, Azim and Paterson. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Farooq, Muhammad
Mansoor, Shahid
Guo, Hui
Amin, Imran
Chee, Peng W.
Azim, M. Kamran
Paterson, Andrew H.
Identification and Characterization of miRNA Transcriptome in Asiatic Cotton (Gossypium arboreum) Using High Throughput Sequencing
title Identification and Characterization of miRNA Transcriptome in Asiatic Cotton (Gossypium arboreum) Using High Throughput Sequencing
title_full Identification and Characterization of miRNA Transcriptome in Asiatic Cotton (Gossypium arboreum) Using High Throughput Sequencing
title_fullStr Identification and Characterization of miRNA Transcriptome in Asiatic Cotton (Gossypium arboreum) Using High Throughput Sequencing
title_full_unstemmed Identification and Characterization of miRNA Transcriptome in Asiatic Cotton (Gossypium arboreum) Using High Throughput Sequencing
title_short Identification and Characterization of miRNA Transcriptome in Asiatic Cotton (Gossypium arboreum) Using High Throughput Sequencing
title_sort identification and characterization of mirna transcriptome in asiatic cotton (gossypium arboreum) using high throughput sequencing
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5471329/
https://www.ncbi.nlm.nih.gov/pubmed/28663752
http://dx.doi.org/10.3389/fpls.2017.00969
work_keys_str_mv AT farooqmuhammad identificationandcharacterizationofmirnatranscriptomeinasiaticcottongossypiumarboreumusinghighthroughputsequencing
AT mansoorshahid identificationandcharacterizationofmirnatranscriptomeinasiaticcottongossypiumarboreumusinghighthroughputsequencing
AT guohui identificationandcharacterizationofmirnatranscriptomeinasiaticcottongossypiumarboreumusinghighthroughputsequencing
AT aminimran identificationandcharacterizationofmirnatranscriptomeinasiaticcottongossypiumarboreumusinghighthroughputsequencing
AT cheepengw identificationandcharacterizationofmirnatranscriptomeinasiaticcottongossypiumarboreumusinghighthroughputsequencing
AT azimmkamran identificationandcharacterizationofmirnatranscriptomeinasiaticcottongossypiumarboreumusinghighthroughputsequencing
AT patersonandrewh identificationandcharacterizationofmirnatranscriptomeinasiaticcottongossypiumarboreumusinghighthroughputsequencing