Cargando…

Insights from the reanalysis of high-throughput chemical genomics data for Escherichia coli K-12

Despite the demonstrated success of genome-wide genetic screens and chemical genomics studies at predicting functions for genes of unknown function or predicting new functions for well-characterized genes, their potential to provide insights into gene function has not been fully explored. We systema...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Peter I-Fan, Ross, Curtis, Siegele, Deborah A, Hu, James C
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8022724/
https://www.ncbi.nlm.nih.gov/pubmed/33561236
http://dx.doi.org/10.1093/g3journal/jkaa035
_version_ 1783674991941779456
author Wu, Peter I-Fan
Ross, Curtis
Siegele, Deborah A
Hu, James C
author_facet Wu, Peter I-Fan
Ross, Curtis
Siegele, Deborah A
Hu, James C
author_sort Wu, Peter I-Fan
collection PubMed
description Despite the demonstrated success of genome-wide genetic screens and chemical genomics studies at predicting functions for genes of unknown function or predicting new functions for well-characterized genes, their potential to provide insights into gene function has not been fully explored. We systematically reanalyzed a published high-throughput phenotypic dataset for the model Gram-negative bacterium Escherichia coli K-12. The availability of high-quality annotation sets allowed us to compare the power of different metrics for measuring phenotypic profile similarity to correctly infer gene function. We conclude that there is no single best method; the three metrics tested gave comparable results for most gene pairs. We also assessed how converting quantitative phenotypes to discrete, qualitative phenotypes affected the association between phenotype and function. Our results indicate that this approach may allow phenotypic data from different studies to be combined to produce a larger dataset that may reveal functional connections between genes not detected in individual studies.
format Online
Article
Text
id pubmed-8022724
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-80227242021-04-09 Insights from the reanalysis of high-throughput chemical genomics data for Escherichia coli K-12 Wu, Peter I-Fan Ross, Curtis Siegele, Deborah A Hu, James C G3 (Bethesda) Investigation Despite the demonstrated success of genome-wide genetic screens and chemical genomics studies at predicting functions for genes of unknown function or predicting new functions for well-characterized genes, their potential to provide insights into gene function has not been fully explored. We systematically reanalyzed a published high-throughput phenotypic dataset for the model Gram-negative bacterium Escherichia coli K-12. The availability of high-quality annotation sets allowed us to compare the power of different metrics for measuring phenotypic profile similarity to correctly infer gene function. We conclude that there is no single best method; the three metrics tested gave comparable results for most gene pairs. We also assessed how converting quantitative phenotypes to discrete, qualitative phenotypes affected the association between phenotype and function. Our results indicate that this approach may allow phenotypic data from different studies to be combined to produce a larger dataset that may reveal functional connections between genes not detected in individual studies. Oxford University Press 2020-12-22 /pmc/articles/PMC8022724/ /pubmed/33561236 http://dx.doi.org/10.1093/g3journal/jkaa035 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigation
Wu, Peter I-Fan
Ross, Curtis
Siegele, Deborah A
Hu, James C
Insights from the reanalysis of high-throughput chemical genomics data for Escherichia coli K-12
title Insights from the reanalysis of high-throughput chemical genomics data for Escherichia coli K-12
title_full Insights from the reanalysis of high-throughput chemical genomics data for Escherichia coli K-12
title_fullStr Insights from the reanalysis of high-throughput chemical genomics data for Escherichia coli K-12
title_full_unstemmed Insights from the reanalysis of high-throughput chemical genomics data for Escherichia coli K-12
title_short Insights from the reanalysis of high-throughput chemical genomics data for Escherichia coli K-12
title_sort insights from the reanalysis of high-throughput chemical genomics data for escherichia coli k-12
topic Investigation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8022724/
https://www.ncbi.nlm.nih.gov/pubmed/33561236
http://dx.doi.org/10.1093/g3journal/jkaa035
work_keys_str_mv AT wupeterifan insightsfromthereanalysisofhighthroughputchemicalgenomicsdataforescherichiacolik12
AT rosscurtis insightsfromthereanalysisofhighthroughputchemicalgenomicsdataforescherichiacolik12
AT siegeledeboraha insightsfromthereanalysisofhighthroughputchemicalgenomicsdataforescherichiacolik12
AT hujamesc insightsfromthereanalysisofhighthroughputchemicalgenomicsdataforescherichiacolik12