Cargando…

Inferring gene ontologies from pairwise similarity data

Motivation: 1. analyze a full matrix of gene–gene pairwise similarities from -omics data; 2. infer true hierarchical structure in these data rather than enforcing hierarchy as a computational artifact; and 3. respect biological pleiotropy, by which a term in the hierarchy can relate to multiple high...

Descripción completa

Detalles Bibliográficos
Autores principales: Kramer, Michael, Dutkowski, Janusz, Yu, Michael, Bafna, Vineet, Ideker, Trey
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4058954/
https://www.ncbi.nlm.nih.gov/pubmed/24932003
http://dx.doi.org/10.1093/bioinformatics/btu282
_version_ 1782321193326477312
author Kramer, Michael
Dutkowski, Janusz
Yu, Michael
Bafna, Vineet
Ideker, Trey
author_facet Kramer, Michael
Dutkowski, Janusz
Yu, Michael
Bafna, Vineet
Ideker, Trey
author_sort Kramer, Michael
collection PubMed
description Motivation: 1. analyze a full matrix of gene–gene pairwise similarities from -omics data; 2. infer true hierarchical structure in these data rather than enforcing hierarchy as a computational artifact; and 3. respect biological pleiotropy, by which a term in the hierarchy can relate to multiple higher level terms. Methods addressing these requirements are just beginning to emerge—none has been evaluated for GO inference. Methods: We consider two algorithms [Clique Extracted Ontology (CliXO), LocalFitness] that uniquely satisfy these requirements, compared with methods including standard clustering. CliXO is a new approach that finds maximal cliques in a network induced by progressive thresholding of a similarity matrix. We evaluate each method’s ability to reconstruct the GO biological process ontology from a similarity matrix based on (a) semantic similarities for GO itself or (b) three -omics datasets for yeast. Results: For task (a) using semantic similarity, CliXO accurately reconstructs GO (>99% precision, recall) and outperforms other approaches (<20% precision, <20% recall). For task (b) using -omics data, CliXO outperforms other methods using two -omics datasets and achieves ∼30% precision and recall using YeastNet v3, similar to an earlier approach (Network Extracted Ontology) and better than LocalFitness or standard clustering (20–25% precision, recall). Conclusion: This study provides algorithmic foundation for building gene ontologies by capturing hierarchical and pleiotropic structure embedded in biomolecular data. Contact: tideker@ucsd.edu
format Online
Article
Text
id pubmed-4058954
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-40589542014-06-18 Inferring gene ontologies from pairwise similarity data Kramer, Michael Dutkowski, Janusz Yu, Michael Bafna, Vineet Ideker, Trey Bioinformatics Ismb 2014 Proceedings Papers Committee Motivation: 1. analyze a full matrix of gene–gene pairwise similarities from -omics data; 2. infer true hierarchical structure in these data rather than enforcing hierarchy as a computational artifact; and 3. respect biological pleiotropy, by which a term in the hierarchy can relate to multiple higher level terms. Methods addressing these requirements are just beginning to emerge—none has been evaluated for GO inference. Methods: We consider two algorithms [Clique Extracted Ontology (CliXO), LocalFitness] that uniquely satisfy these requirements, compared with methods including standard clustering. CliXO is a new approach that finds maximal cliques in a network induced by progressive thresholding of a similarity matrix. We evaluate each method’s ability to reconstruct the GO biological process ontology from a similarity matrix based on (a) semantic similarities for GO itself or (b) three -omics datasets for yeast. Results: For task (a) using semantic similarity, CliXO accurately reconstructs GO (>99% precision, recall) and outperforms other approaches (<20% precision, <20% recall). For task (b) using -omics data, CliXO outperforms other methods using two -omics datasets and achieves ∼30% precision and recall using YeastNet v3, similar to an earlier approach (Network Extracted Ontology) and better than LocalFitness or standard clustering (20–25% precision, recall). Conclusion: This study provides algorithmic foundation for building gene ontologies by capturing hierarchical and pleiotropic structure embedded in biomolecular data. Contact: tideker@ucsd.edu Oxford University Press 2014-06-15 2014-06-11 /pmc/articles/PMC4058954/ /pubmed/24932003 http://dx.doi.org/10.1093/bioinformatics/btu282 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb 2014 Proceedings Papers Committee
Kramer, Michael
Dutkowski, Janusz
Yu, Michael
Bafna, Vineet
Ideker, Trey
Inferring gene ontologies from pairwise similarity data
title Inferring gene ontologies from pairwise similarity data
title_full Inferring gene ontologies from pairwise similarity data
title_fullStr Inferring gene ontologies from pairwise similarity data
title_full_unstemmed Inferring gene ontologies from pairwise similarity data
title_short Inferring gene ontologies from pairwise similarity data
title_sort inferring gene ontologies from pairwise similarity data
topic Ismb 2014 Proceedings Papers Committee
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4058954/
https://www.ncbi.nlm.nih.gov/pubmed/24932003
http://dx.doi.org/10.1093/bioinformatics/btu282
work_keys_str_mv AT kramermichael inferringgeneontologiesfrompairwisesimilaritydata
AT dutkowskijanusz inferringgeneontologiesfrompairwisesimilaritydata
AT yumichael inferringgeneontologiesfrompairwisesimilaritydata
AT bafnavineet inferringgeneontologiesfrompairwisesimilaritydata
AT idekertrey inferringgeneontologiesfrompairwisesimilaritydata