Cargando…

Genecentric: a package to uncover graph-theoretic structure in high-throughput epistasis data

BACKGROUND: New technology has resulted in high-throughput screens for pairwise genetic interactions in yeast and other model organisms. For each pair in a collection of non-essential genes, an epistasis score is obtained, representing how much sicker (or healthier) the double-knockout organism will...

Descripción completa

Detalles Bibliográficos
Autores principales: Gallant, Andrew, Leiserson, Mark DM, Kachalov, Maxim, Cowen, Lenore J, Hescott, Benjamin J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3614884/
https://www.ncbi.nlm.nih.gov/pubmed/23331614
http://dx.doi.org/10.1186/1471-2105-14-23
_version_ 1782264937696985088
author Gallant, Andrew
Leiserson, Mark DM
Kachalov, Maxim
Cowen, Lenore J
Hescott, Benjamin J
author_facet Gallant, Andrew
Leiserson, Mark DM
Kachalov, Maxim
Cowen, Lenore J
Hescott, Benjamin J
author_sort Gallant, Andrew
collection PubMed
description BACKGROUND: New technology has resulted in high-throughput screens for pairwise genetic interactions in yeast and other model organisms. For each pair in a collection of non-essential genes, an epistasis score is obtained, representing how much sicker (or healthier) the double-knockout organism will be compared to what would be expected from the sickness of the component single knockouts. Recent algorithmic work has identified graph-theoretic patterns in this data that can indicate functional modules, and even sets of genes that may occur in compensatory pathways, such as a BPM-type schema first introduced by Kelley and Ideker. However, to date, any algorithms for finding such patterns in the data were implemented internally, with no software being made publically available. RESULTS: Genecentric is a new package that implements a parallelized version of the Leiserson et al. algorithm (J Comput Biol 18:1399-1409, 2011) for generating generalized BPMs from high-throughput genetic interaction data. Given a matrix of weighted epistasis values for a set of double knock-outs, Genecentric returns a list of generalized BPMs that may represent compensatory pathways. Genecentric also has an extension, GenecentricGO, to query FuncAssociate (Bioinformatics 25:3043-3044, 2009) to retrieve GO enrichment statistics on generated BPMs. Python is the only dependency, and our web site provides working examples and documentation. CONCLUSION: We find that Genecentric can be used to find coherent functional and perhaps compensatory gene sets from high throughput genetic interaction data. Genecentric is made freely available for download under the GPLv2 from http://bcb.cs.tufts.edu/genecentric.
format Online
Article
Text
id pubmed-3614884
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36148842013-04-03 Genecentric: a package to uncover graph-theoretic structure in high-throughput epistasis data Gallant, Andrew Leiserson, Mark DM Kachalov, Maxim Cowen, Lenore J Hescott, Benjamin J BMC Bioinformatics Software BACKGROUND: New technology has resulted in high-throughput screens for pairwise genetic interactions in yeast and other model organisms. For each pair in a collection of non-essential genes, an epistasis score is obtained, representing how much sicker (or healthier) the double-knockout organism will be compared to what would be expected from the sickness of the component single knockouts. Recent algorithmic work has identified graph-theoretic patterns in this data that can indicate functional modules, and even sets of genes that may occur in compensatory pathways, such as a BPM-type schema first introduced by Kelley and Ideker. However, to date, any algorithms for finding such patterns in the data were implemented internally, with no software being made publically available. RESULTS: Genecentric is a new package that implements a parallelized version of the Leiserson et al. algorithm (J Comput Biol 18:1399-1409, 2011) for generating generalized BPMs from high-throughput genetic interaction data. Given a matrix of weighted epistasis values for a set of double knock-outs, Genecentric returns a list of generalized BPMs that may represent compensatory pathways. Genecentric also has an extension, GenecentricGO, to query FuncAssociate (Bioinformatics 25:3043-3044, 2009) to retrieve GO enrichment statistics on generated BPMs. Python is the only dependency, and our web site provides working examples and documentation. CONCLUSION: We find that Genecentric can be used to find coherent functional and perhaps compensatory gene sets from high throughput genetic interaction data. Genecentric is made freely available for download under the GPLv2 from http://bcb.cs.tufts.edu/genecentric. BioMed Central 2013-01-18 /pmc/articles/PMC3614884/ /pubmed/23331614 http://dx.doi.org/10.1186/1471-2105-14-23 Text en Copyright © 2013 Gallant et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Gallant, Andrew
Leiserson, Mark DM
Kachalov, Maxim
Cowen, Lenore J
Hescott, Benjamin J
Genecentric: a package to uncover graph-theoretic structure in high-throughput epistasis data
title Genecentric: a package to uncover graph-theoretic structure in high-throughput epistasis data
title_full Genecentric: a package to uncover graph-theoretic structure in high-throughput epistasis data
title_fullStr Genecentric: a package to uncover graph-theoretic structure in high-throughput epistasis data
title_full_unstemmed Genecentric: a package to uncover graph-theoretic structure in high-throughput epistasis data
title_short Genecentric: a package to uncover graph-theoretic structure in high-throughput epistasis data
title_sort genecentric: a package to uncover graph-theoretic structure in high-throughput epistasis data
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3614884/
https://www.ncbi.nlm.nih.gov/pubmed/23331614
http://dx.doi.org/10.1186/1471-2105-14-23
work_keys_str_mv AT gallantandrew genecentricapackagetouncovergraphtheoreticstructureinhighthroughputepistasisdata
AT leisersonmarkdm genecentricapackagetouncovergraphtheoreticstructureinhighthroughputepistasisdata
AT kachalovmaxim genecentricapackagetouncovergraphtheoreticstructureinhighthroughputepistasisdata
AT cowenlenorej genecentricapackagetouncovergraphtheoreticstructureinhighthroughputepistasisdata
AT hescottbenjaminj genecentricapackagetouncovergraphtheoreticstructureinhighthroughputepistasisdata