Cargando…

A Statistical Framework for Improving Genomic Annotations of Prokaryotic Essential Genes

Large-scale systematic analysis of gene essentiality is an important step closer toward unraveling the complex relationship between genotypes and phenotypes. Such analysis cannot be accomplished without unbiased and accurate annotations of essential genes. In current genomic databases, most of the e...

Descripción completa

Detalles Bibliográficos
Autores principales: Deng, Jingyuan, Su, Shengchang, Lin, Xiaodong, Hassett, Daniel J., Lu, Long Jason
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592911/
https://www.ncbi.nlm.nih.gov/pubmed/23520492
http://dx.doi.org/10.1371/journal.pone.0058178
_version_ 1782262207083446272
author Deng, Jingyuan
Su, Shengchang
Lin, Xiaodong
Hassett, Daniel J.
Lu, Long Jason
author_facet Deng, Jingyuan
Su, Shengchang
Lin, Xiaodong
Hassett, Daniel J.
Lu, Long Jason
author_sort Deng, Jingyuan
collection PubMed
description Large-scale systematic analysis of gene essentiality is an important step closer toward unraveling the complex relationship between genotypes and phenotypes. Such analysis cannot be accomplished without unbiased and accurate annotations of essential genes. In current genomic databases, most of the essential gene annotations are derived from whole-genome transposon mutagenesis (TM), the most frequently used experimental approach for determining essential genes in microorganisms under defined conditions. However, there are substantial systematic biases associated with TM experiments. In this study, we developed a novel Poisson model–based statistical framework to simulate the TM insertion process and subsequently correct the experimental biases. We first quantitatively assessed the effects of major factors that potentially influence the accuracy of TM and subsequently incorporated relevant factors into the framework. Through iteratively optimizing parameters, we inferred the actual insertion events occurred and described each gene’s essentiality on probability measure. Evaluated by the definite mapping of essential gene profile in Escherichia coli, our model significantly improved the accuracy of original TM datasets, resulting in more accurate annotations of essential genes. Our method also showed encouraging results in improving subsaturation level TM datasets. To test our model’s broad applicability to other bacteria, we applied it to Pseudomonas aeruginosa PAO1 and Francisella tularensis novicida TM datasets. We validated our predictions by literature as well as allelic exchange experiments in PAO1. Our model was correct on six of the seven tested genes. Remarkably, among all three cases that our predictions contradicted the TM assignments, experimental validations supported our predictions. In summary, our method will be a promising tool in improving genomic annotations of essential genes and enabling large-scale explorations of gene essentiality. Our contribution is timely considering the rapidly increasing essential gene sets. A Webserver has been set up to provide convenient access to this tool. All results and source codes are available for download upon publication at http://research.cchmc.org/essentialgene/.
format Online
Article
Text
id pubmed-3592911
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-35929112013-03-21 A Statistical Framework for Improving Genomic Annotations of Prokaryotic Essential Genes Deng, Jingyuan Su, Shengchang Lin, Xiaodong Hassett, Daniel J. Lu, Long Jason PLoS One Research Article Large-scale systematic analysis of gene essentiality is an important step closer toward unraveling the complex relationship between genotypes and phenotypes. Such analysis cannot be accomplished without unbiased and accurate annotations of essential genes. In current genomic databases, most of the essential gene annotations are derived from whole-genome transposon mutagenesis (TM), the most frequently used experimental approach for determining essential genes in microorganisms under defined conditions. However, there are substantial systematic biases associated with TM experiments. In this study, we developed a novel Poisson model–based statistical framework to simulate the TM insertion process and subsequently correct the experimental biases. We first quantitatively assessed the effects of major factors that potentially influence the accuracy of TM and subsequently incorporated relevant factors into the framework. Through iteratively optimizing parameters, we inferred the actual insertion events occurred and described each gene’s essentiality on probability measure. Evaluated by the definite mapping of essential gene profile in Escherichia coli, our model significantly improved the accuracy of original TM datasets, resulting in more accurate annotations of essential genes. Our method also showed encouraging results in improving subsaturation level TM datasets. To test our model’s broad applicability to other bacteria, we applied it to Pseudomonas aeruginosa PAO1 and Francisella tularensis novicida TM datasets. We validated our predictions by literature as well as allelic exchange experiments in PAO1. Our model was correct on six of the seven tested genes. Remarkably, among all three cases that our predictions contradicted the TM assignments, experimental validations supported our predictions. In summary, our method will be a promising tool in improving genomic annotations of essential genes and enabling large-scale explorations of gene essentiality. Our contribution is timely considering the rapidly increasing essential gene sets. A Webserver has been set up to provide convenient access to this tool. All results and source codes are available for download upon publication at http://research.cchmc.org/essentialgene/. Public Library of Science 2013-03-08 /pmc/articles/PMC3592911/ /pubmed/23520492 http://dx.doi.org/10.1371/journal.pone.0058178 Text en © 2013 Deng et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Deng, Jingyuan
Su, Shengchang
Lin, Xiaodong
Hassett, Daniel J.
Lu, Long Jason
A Statistical Framework for Improving Genomic Annotations of Prokaryotic Essential Genes
title A Statistical Framework for Improving Genomic Annotations of Prokaryotic Essential Genes
title_full A Statistical Framework for Improving Genomic Annotations of Prokaryotic Essential Genes
title_fullStr A Statistical Framework for Improving Genomic Annotations of Prokaryotic Essential Genes
title_full_unstemmed A Statistical Framework for Improving Genomic Annotations of Prokaryotic Essential Genes
title_short A Statistical Framework for Improving Genomic Annotations of Prokaryotic Essential Genes
title_sort statistical framework for improving genomic annotations of prokaryotic essential genes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3592911/
https://www.ncbi.nlm.nih.gov/pubmed/23520492
http://dx.doi.org/10.1371/journal.pone.0058178
work_keys_str_mv AT dengjingyuan astatisticalframeworkforimprovinggenomicannotationsofprokaryoticessentialgenes
AT sushengchang astatisticalframeworkforimprovinggenomicannotationsofprokaryoticessentialgenes
AT linxiaodong astatisticalframeworkforimprovinggenomicannotationsofprokaryoticessentialgenes
AT hassettdanielj astatisticalframeworkforimprovinggenomicannotationsofprokaryoticessentialgenes
AT lulongjason astatisticalframeworkforimprovinggenomicannotationsofprokaryoticessentialgenes
AT dengjingyuan statisticalframeworkforimprovinggenomicannotationsofprokaryoticessentialgenes
AT sushengchang statisticalframeworkforimprovinggenomicannotationsofprokaryoticessentialgenes
AT linxiaodong statisticalframeworkforimprovinggenomicannotationsofprokaryoticessentialgenes
AT hassettdanielj statisticalframeworkforimprovinggenomicannotationsofprokaryoticessentialgenes
AT lulongjason statisticalframeworkforimprovinggenomicannotationsofprokaryoticessentialgenes