Cargando…
ARTADE2DB: Improved Statistical Inferences for Arabidopsis Gene Functions and Structure Predictions by Dynamic Structure-Based Dynamic Expression (DSDE) Analyses
Recent advances in technologies for observing high-resolution genomic activities, such as whole-genome tiling arrays and high-throughput sequencers, provide detailed information for understanding genome functions. However, the functions of 50% of known Arabidopsis thaliana genes remain unknown or ar...
Autores principales: | , , , , , , , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3037080/ https://www.ncbi.nlm.nih.gov/pubmed/21227933 http://dx.doi.org/10.1093/pcp/pcq202 |
_version_ | 1782197949960290304 |
---|---|
author | Iida, Kei Kawaguchi, Shuji Kobayashi, Norio Yoshida, Yuko Ishii, Manabu Harada, Erimi Hanada, Kousuke Matsui, Akihiro Okamoto, Masanori Ishida, Junko Tanaka, Maho Morosawa, Taeko Seki, Motoaki Toyoda, Tetsuro |
author_facet | Iida, Kei Kawaguchi, Shuji Kobayashi, Norio Yoshida, Yuko Ishii, Manabu Harada, Erimi Hanada, Kousuke Matsui, Akihiro Okamoto, Masanori Ishida, Junko Tanaka, Maho Morosawa, Taeko Seki, Motoaki Toyoda, Tetsuro |
author_sort | Iida, Kei |
collection | PubMed |
description | Recent advances in technologies for observing high-resolution genomic activities, such as whole-genome tiling arrays and high-throughput sequencers, provide detailed information for understanding genome functions. However, the functions of 50% of known Arabidopsis thaliana genes remain unknown or are annotated only on the basis of static analyses such as protein motifs or similarities. In this paper, we describe dynamic structure-based dynamic expression (DSDE) analysis, which sequentially predicts both structural and functional features of transcripts. We show that DSDE analysis inferred gene functions 12% more precisely than static structure-based dynamic expression (SSDE) analysis or conventional co-expression analysis based on previously determined gene structures of A. thaliana. This result suggests that more precise structural information than the fixed conventional annotated structures is crucial for co-expression analysis in systems biology of transcriptional regulation and dynamics. Our DSDE method, ARabidopsis Tiling-Array-based Detection of Exons version 2 and over-representation analysis (ARTADE2-ORA), precisely predicts each gene structure by combining two statistical analyses: a probe-wise co-expression analysis of multiple transcriptome measurements and a Markov model analysis of genome sequences. ARTADE2-ORA successfully identified the true functions of about 90% of functionally annotated genes, inferred the functions of 98% of functionally unknown genes and predicted 1,489 new gene structures and functions. We developed a database ARTADE2DB that integrates not only the information predicted by ARTADE2-ORA but also annotations and other functional information, such as phenotypes and literature citations, and is expected to contribute to the study of the functional genomics of A. thaliana. URL: http://artade.org. |
format | Text |
id | pubmed-3037080 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-30370802011-02-10 ARTADE2DB: Improved Statistical Inferences for Arabidopsis Gene Functions and Structure Predictions by Dynamic Structure-Based Dynamic Expression (DSDE) Analyses Iida, Kei Kawaguchi, Shuji Kobayashi, Norio Yoshida, Yuko Ishii, Manabu Harada, Erimi Hanada, Kousuke Matsui, Akihiro Okamoto, Masanori Ishida, Junko Tanaka, Maho Morosawa, Taeko Seki, Motoaki Toyoda, Tetsuro Plant Cell Physiol Special Issue – Databases Recent advances in technologies for observing high-resolution genomic activities, such as whole-genome tiling arrays and high-throughput sequencers, provide detailed information for understanding genome functions. However, the functions of 50% of known Arabidopsis thaliana genes remain unknown or are annotated only on the basis of static analyses such as protein motifs or similarities. In this paper, we describe dynamic structure-based dynamic expression (DSDE) analysis, which sequentially predicts both structural and functional features of transcripts. We show that DSDE analysis inferred gene functions 12% more precisely than static structure-based dynamic expression (SSDE) analysis or conventional co-expression analysis based on previously determined gene structures of A. thaliana. This result suggests that more precise structural information than the fixed conventional annotated structures is crucial for co-expression analysis in systems biology of transcriptional regulation and dynamics. Our DSDE method, ARabidopsis Tiling-Array-based Detection of Exons version 2 and over-representation analysis (ARTADE2-ORA), precisely predicts each gene structure by combining two statistical analyses: a probe-wise co-expression analysis of multiple transcriptome measurements and a Markov model analysis of genome sequences. ARTADE2-ORA successfully identified the true functions of about 90% of functionally annotated genes, inferred the functions of 98% of functionally unknown genes and predicted 1,489 new gene structures and functions. We developed a database ARTADE2DB that integrates not only the information predicted by ARTADE2-ORA but also annotations and other functional information, such as phenotypes and literature citations, and is expected to contribute to the study of the functional genomics of A. thaliana. URL: http://artade.org. Oxford University Press 2011-02 2011-01-12 /pmc/articles/PMC3037080/ /pubmed/21227933 http://dx.doi.org/10.1093/pcp/pcq202 Text en © The Author 2011. Published by Oxford University Press on behalf of Japanese Society of Plant Physiologists. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Special Issue – Databases Iida, Kei Kawaguchi, Shuji Kobayashi, Norio Yoshida, Yuko Ishii, Manabu Harada, Erimi Hanada, Kousuke Matsui, Akihiro Okamoto, Masanori Ishida, Junko Tanaka, Maho Morosawa, Taeko Seki, Motoaki Toyoda, Tetsuro ARTADE2DB: Improved Statistical Inferences for Arabidopsis Gene Functions and Structure Predictions by Dynamic Structure-Based Dynamic Expression (DSDE) Analyses |
title | ARTADE2DB: Improved Statistical Inferences for Arabidopsis Gene Functions and Structure Predictions by Dynamic Structure-Based Dynamic Expression (DSDE) Analyses |
title_full | ARTADE2DB: Improved Statistical Inferences for Arabidopsis Gene Functions and Structure Predictions by Dynamic Structure-Based Dynamic Expression (DSDE) Analyses |
title_fullStr | ARTADE2DB: Improved Statistical Inferences for Arabidopsis Gene Functions and Structure Predictions by Dynamic Structure-Based Dynamic Expression (DSDE) Analyses |
title_full_unstemmed | ARTADE2DB: Improved Statistical Inferences for Arabidopsis Gene Functions and Structure Predictions by Dynamic Structure-Based Dynamic Expression (DSDE) Analyses |
title_short | ARTADE2DB: Improved Statistical Inferences for Arabidopsis Gene Functions and Structure Predictions by Dynamic Structure-Based Dynamic Expression (DSDE) Analyses |
title_sort | artade2db: improved statistical inferences for arabidopsis gene functions and structure predictions by dynamic structure-based dynamic expression (dsde) analyses |
topic | Special Issue – Databases |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3037080/ https://www.ncbi.nlm.nih.gov/pubmed/21227933 http://dx.doi.org/10.1093/pcp/pcq202 |
work_keys_str_mv | AT iidakei artade2dbimprovedstatisticalinferencesforarabidopsisgenefunctionsandstructurepredictionsbydynamicstructurebaseddynamicexpressiondsdeanalyses AT kawaguchishuji artade2dbimprovedstatisticalinferencesforarabidopsisgenefunctionsandstructurepredictionsbydynamicstructurebaseddynamicexpressiondsdeanalyses AT kobayashinorio artade2dbimprovedstatisticalinferencesforarabidopsisgenefunctionsandstructurepredictionsbydynamicstructurebaseddynamicexpressiondsdeanalyses AT yoshidayuko artade2dbimprovedstatisticalinferencesforarabidopsisgenefunctionsandstructurepredictionsbydynamicstructurebaseddynamicexpressiondsdeanalyses AT ishiimanabu artade2dbimprovedstatisticalinferencesforarabidopsisgenefunctionsandstructurepredictionsbydynamicstructurebaseddynamicexpressiondsdeanalyses AT haradaerimi artade2dbimprovedstatisticalinferencesforarabidopsisgenefunctionsandstructurepredictionsbydynamicstructurebaseddynamicexpressiondsdeanalyses AT hanadakousuke artade2dbimprovedstatisticalinferencesforarabidopsisgenefunctionsandstructurepredictionsbydynamicstructurebaseddynamicexpressiondsdeanalyses AT matsuiakihiro artade2dbimprovedstatisticalinferencesforarabidopsisgenefunctionsandstructurepredictionsbydynamicstructurebaseddynamicexpressiondsdeanalyses AT okamotomasanori artade2dbimprovedstatisticalinferencesforarabidopsisgenefunctionsandstructurepredictionsbydynamicstructurebaseddynamicexpressiondsdeanalyses AT ishidajunko artade2dbimprovedstatisticalinferencesforarabidopsisgenefunctionsandstructurepredictionsbydynamicstructurebaseddynamicexpressiondsdeanalyses AT tanakamaho artade2dbimprovedstatisticalinferencesforarabidopsisgenefunctionsandstructurepredictionsbydynamicstructurebaseddynamicexpressiondsdeanalyses AT morosawataeko artade2dbimprovedstatisticalinferencesforarabidopsisgenefunctionsandstructurepredictionsbydynamicstructurebaseddynamicexpressiondsdeanalyses AT sekimotoaki artade2dbimprovedstatisticalinferencesforarabidopsisgenefunctionsandstructurepredictionsbydynamicstructurebaseddynamicexpressiondsdeanalyses AT toyodatetsuro artade2dbimprovedstatisticalinferencesforarabidopsisgenefunctionsandstructurepredictionsbydynamicstructurebaseddynamicexpressiondsdeanalyses |