Cargando…

Maximal Extraction of Biological Information from Genetic Interaction Data

Extraction of all the biological information inherent in large-scale genetic interaction datasets remains a significant challenge for systems biology. The core problem is essentially that of classification of the relationships among phenotypes of mutant strains into biologically informative “rules”...

Descripción completa

Detalles Bibliográficos
Autores principales: Carter, Gregory W., Galas, David J., Galitski, Timothy
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2659753/
https://www.ncbi.nlm.nih.gov/pubmed/19343223
http://dx.doi.org/10.1371/journal.pcbi.1000347
_version_ 1782165691220099072
author Carter, Gregory W.
Galas, David J.
Galitski, Timothy
author_facet Carter, Gregory W.
Galas, David J.
Galitski, Timothy
author_sort Carter, Gregory W.
collection PubMed
description Extraction of all the biological information inherent in large-scale genetic interaction datasets remains a significant challenge for systems biology. The core problem is essentially that of classification of the relationships among phenotypes of mutant strains into biologically informative “rules” of gene interaction. Geneticists have determined such classifications based on insights from biological examples, but it is not clear that there is a systematic, unsupervised way to extract this information. In this paper we describe such a method that depends on maximizing a previously described context-dependent information measure to obtain maximally informative biological networks. We have successfully validated this method on two examples from yeast by demonstrating that more biological information is obtained when analysis is guided by this information measure. The context-dependent information measure is a function only of phenotype data and a set of interaction rules, involving no prior biological knowledge. Analysis of the resulting networks reveals that the most biologically informative networks are those with the greatest context-dependent information scores. We propose that these high-complexity networks reveal genetic architecture at a modular level, in contrast to classical genetic interaction rules that order genes in pathways. We suggest that our analysis represents a powerful, data-driven, and general approach to genetic interaction analysis, with particular potential in the study of mammalian systems in which interactions are complex and gene annotation data are sparse.
format Text
id pubmed-2659753
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-26597532009-04-03 Maximal Extraction of Biological Information from Genetic Interaction Data Carter, Gregory W. Galas, David J. Galitski, Timothy PLoS Comput Biol Research Article Extraction of all the biological information inherent in large-scale genetic interaction datasets remains a significant challenge for systems biology. The core problem is essentially that of classification of the relationships among phenotypes of mutant strains into biologically informative “rules” of gene interaction. Geneticists have determined such classifications based on insights from biological examples, but it is not clear that there is a systematic, unsupervised way to extract this information. In this paper we describe such a method that depends on maximizing a previously described context-dependent information measure to obtain maximally informative biological networks. We have successfully validated this method on two examples from yeast by demonstrating that more biological information is obtained when analysis is guided by this information measure. The context-dependent information measure is a function only of phenotype data and a set of interaction rules, involving no prior biological knowledge. Analysis of the resulting networks reveals that the most biologically informative networks are those with the greatest context-dependent information scores. We propose that these high-complexity networks reveal genetic architecture at a modular level, in contrast to classical genetic interaction rules that order genes in pathways. We suggest that our analysis represents a powerful, data-driven, and general approach to genetic interaction analysis, with particular potential in the study of mammalian systems in which interactions are complex and gene annotation data are sparse. Public Library of Science 2009-04-03 /pmc/articles/PMC2659753/ /pubmed/19343223 http://dx.doi.org/10.1371/journal.pcbi.1000347 Text en Carter et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Carter, Gregory W.
Galas, David J.
Galitski, Timothy
Maximal Extraction of Biological Information from Genetic Interaction Data
title Maximal Extraction of Biological Information from Genetic Interaction Data
title_full Maximal Extraction of Biological Information from Genetic Interaction Data
title_fullStr Maximal Extraction of Biological Information from Genetic Interaction Data
title_full_unstemmed Maximal Extraction of Biological Information from Genetic Interaction Data
title_short Maximal Extraction of Biological Information from Genetic Interaction Data
title_sort maximal extraction of biological information from genetic interaction data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2659753/
https://www.ncbi.nlm.nih.gov/pubmed/19343223
http://dx.doi.org/10.1371/journal.pcbi.1000347
work_keys_str_mv AT cartergregoryw maximalextractionofbiologicalinformationfromgeneticinteractiondata
AT galasdavidj maximalextractionofbiologicalinformationfromgeneticinteractiondata
AT galitskitimothy maximalextractionofbiologicalinformationfromgeneticinteractiondata