Cargando…

Genome-Wide Functional Analysis of the Cotton Transcriptome by Creating an Integrated EST Database

A total of 28,432 unique contigs (25,371 in consensus contigs and 3,061 as singletons) were assembled from all 268,786 cotton ESTs currently available. Several in silico approaches [comparative genomics, Blast, Gene Ontology (GO) analysis, and pathway enrichment by Kyoto Encyclopedia of Genes and Ge...

Descripción completa

Detalles Bibliográficos
Autores principales: Xie, Fuliang, Sun, Guiling, Stiller, John W., Zhang, Baohong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3210780/
https://www.ncbi.nlm.nih.gov/pubmed/22087239
http://dx.doi.org/10.1371/journal.pone.0026980
_version_ 1782215762498289664
author Xie, Fuliang
Sun, Guiling
Stiller, John W.
Zhang, Baohong
author_facet Xie, Fuliang
Sun, Guiling
Stiller, John W.
Zhang, Baohong
author_sort Xie, Fuliang
collection PubMed
description A total of 28,432 unique contigs (25,371 in consensus contigs and 3,061 as singletons) were assembled from all 268,786 cotton ESTs currently available. Several in silico approaches [comparative genomics, Blast, Gene Ontology (GO) analysis, and pathway enrichment by Kyoto Encyclopedia of Genes and Genomes (KEGG)] were employed to investigate global functions of the cotton transcriptome. Cotton EST contigs were clustered into 5,461 groups with a maximum cluster size of 196 members. A total of 27,956 indel mutants and 149,616 single nucleotide polymorphisms (SNPs) were identified from consensus contigs. Interestingly, many contigs with significantly high frequencies of indels or SNPs encode transcription factors and protein kinases. In a comparison with six model plant species, cotton ESTs show the highest overall similarity to grape. A total of 87 cotton miRNAs were identified; 59 of these have not been reported previously from experimental or bioinformatics investigations. We also predicted 3,260 genes as miRNAs targets, which are associated with multiple biological functions, including stress response, metabolism, hormone signal transduction and fiber development. We identified 151 and 4,214 EST-simple sequence repeats (SSRs) from contigs and raw ESTs respectively. To make these data widely available, and to facilitate access to EST-related genetic information, we integrated our results into a comprehensive, fully downloadable web-based cotton EST database (www.leonxie.com).
format Online
Article
Text
id pubmed-3210780
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-32107802011-11-15 Genome-Wide Functional Analysis of the Cotton Transcriptome by Creating an Integrated EST Database Xie, Fuliang Sun, Guiling Stiller, John W. Zhang, Baohong PLoS One Research Article A total of 28,432 unique contigs (25,371 in consensus contigs and 3,061 as singletons) were assembled from all 268,786 cotton ESTs currently available. Several in silico approaches [comparative genomics, Blast, Gene Ontology (GO) analysis, and pathway enrichment by Kyoto Encyclopedia of Genes and Genomes (KEGG)] were employed to investigate global functions of the cotton transcriptome. Cotton EST contigs were clustered into 5,461 groups with a maximum cluster size of 196 members. A total of 27,956 indel mutants and 149,616 single nucleotide polymorphisms (SNPs) were identified from consensus contigs. Interestingly, many contigs with significantly high frequencies of indels or SNPs encode transcription factors and protein kinases. In a comparison with six model plant species, cotton ESTs show the highest overall similarity to grape. A total of 87 cotton miRNAs were identified; 59 of these have not been reported previously from experimental or bioinformatics investigations. We also predicted 3,260 genes as miRNAs targets, which are associated with multiple biological functions, including stress response, metabolism, hormone signal transduction and fiber development. We identified 151 and 4,214 EST-simple sequence repeats (SSRs) from contigs and raw ESTs respectively. To make these data widely available, and to facilitate access to EST-related genetic information, we integrated our results into a comprehensive, fully downloadable web-based cotton EST database (www.leonxie.com). Public Library of Science 2011-11-08 /pmc/articles/PMC3210780/ /pubmed/22087239 http://dx.doi.org/10.1371/journal.pone.0026980 Text en Xie et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Xie, Fuliang
Sun, Guiling
Stiller, John W.
Zhang, Baohong
Genome-Wide Functional Analysis of the Cotton Transcriptome by Creating an Integrated EST Database
title Genome-Wide Functional Analysis of the Cotton Transcriptome by Creating an Integrated EST Database
title_full Genome-Wide Functional Analysis of the Cotton Transcriptome by Creating an Integrated EST Database
title_fullStr Genome-Wide Functional Analysis of the Cotton Transcriptome by Creating an Integrated EST Database
title_full_unstemmed Genome-Wide Functional Analysis of the Cotton Transcriptome by Creating an Integrated EST Database
title_short Genome-Wide Functional Analysis of the Cotton Transcriptome by Creating an Integrated EST Database
title_sort genome-wide functional analysis of the cotton transcriptome by creating an integrated est database
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3210780/
https://www.ncbi.nlm.nih.gov/pubmed/22087239
http://dx.doi.org/10.1371/journal.pone.0026980
work_keys_str_mv AT xiefuliang genomewidefunctionalanalysisofthecottontranscriptomebycreatinganintegratedestdatabase
AT sunguiling genomewidefunctionalanalysisofthecottontranscriptomebycreatinganintegratedestdatabase
AT stillerjohnw genomewidefunctionalanalysisofthecottontranscriptomebycreatinganintegratedestdatabase
AT zhangbaohong genomewidefunctionalanalysisofthecottontranscriptomebycreatinganintegratedestdatabase