Cargando…
Inferring gene ontologies from pairwise similarity data
Motivation: 1. analyze a full matrix of gene–gene pairwise similarities from -omics data; 2. infer true hierarchical structure in these data rather than enforcing hierarchy as a computational artifact; and 3. respect biological pleiotropy, by which a term in the hierarchy can relate to multiple high...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4058954/ https://www.ncbi.nlm.nih.gov/pubmed/24932003 http://dx.doi.org/10.1093/bioinformatics/btu282 |
_version_ | 1782321193326477312 |
---|---|
author | Kramer, Michael Dutkowski, Janusz Yu, Michael Bafna, Vineet Ideker, Trey |
author_facet | Kramer, Michael Dutkowski, Janusz Yu, Michael Bafna, Vineet Ideker, Trey |
author_sort | Kramer, Michael |
collection | PubMed |
description | Motivation: 1. analyze a full matrix of gene–gene pairwise similarities from -omics data; 2. infer true hierarchical structure in these data rather than enforcing hierarchy as a computational artifact; and 3. respect biological pleiotropy, by which a term in the hierarchy can relate to multiple higher level terms. Methods addressing these requirements are just beginning to emerge—none has been evaluated for GO inference. Methods: We consider two algorithms [Clique Extracted Ontology (CliXO), LocalFitness] that uniquely satisfy these requirements, compared with methods including standard clustering. CliXO is a new approach that finds maximal cliques in a network induced by progressive thresholding of a similarity matrix. We evaluate each method’s ability to reconstruct the GO biological process ontology from a similarity matrix based on (a) semantic similarities for GO itself or (b) three -omics datasets for yeast. Results: For task (a) using semantic similarity, CliXO accurately reconstructs GO (>99% precision, recall) and outperforms other approaches (<20% precision, <20% recall). For task (b) using -omics data, CliXO outperforms other methods using two -omics datasets and achieves ∼30% precision and recall using YeastNet v3, similar to an earlier approach (Network Extracted Ontology) and better than LocalFitness or standard clustering (20–25% precision, recall). Conclusion: This study provides algorithmic foundation for building gene ontologies by capturing hierarchical and pleiotropic structure embedded in biomolecular data. Contact: tideker@ucsd.edu |
format | Online Article Text |
id | pubmed-4058954 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-40589542014-06-18 Inferring gene ontologies from pairwise similarity data Kramer, Michael Dutkowski, Janusz Yu, Michael Bafna, Vineet Ideker, Trey Bioinformatics Ismb 2014 Proceedings Papers Committee Motivation: 1. analyze a full matrix of gene–gene pairwise similarities from -omics data; 2. infer true hierarchical structure in these data rather than enforcing hierarchy as a computational artifact; and 3. respect biological pleiotropy, by which a term in the hierarchy can relate to multiple higher level terms. Methods addressing these requirements are just beginning to emerge—none has been evaluated for GO inference. Methods: We consider two algorithms [Clique Extracted Ontology (CliXO), LocalFitness] that uniquely satisfy these requirements, compared with methods including standard clustering. CliXO is a new approach that finds maximal cliques in a network induced by progressive thresholding of a similarity matrix. We evaluate each method’s ability to reconstruct the GO biological process ontology from a similarity matrix based on (a) semantic similarities for GO itself or (b) three -omics datasets for yeast. Results: For task (a) using semantic similarity, CliXO accurately reconstructs GO (>99% precision, recall) and outperforms other approaches (<20% precision, <20% recall). For task (b) using -omics data, CliXO outperforms other methods using two -omics datasets and achieves ∼30% precision and recall using YeastNet v3, similar to an earlier approach (Network Extracted Ontology) and better than LocalFitness or standard clustering (20–25% precision, recall). Conclusion: This study provides algorithmic foundation for building gene ontologies by capturing hierarchical and pleiotropic structure embedded in biomolecular data. Contact: tideker@ucsd.edu Oxford University Press 2014-06-15 2014-06-11 /pmc/articles/PMC4058954/ /pubmed/24932003 http://dx.doi.org/10.1093/bioinformatics/btu282 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Ismb 2014 Proceedings Papers Committee Kramer, Michael Dutkowski, Janusz Yu, Michael Bafna, Vineet Ideker, Trey Inferring gene ontologies from pairwise similarity data |
title | Inferring gene ontologies from pairwise similarity data |
title_full | Inferring gene ontologies from pairwise similarity data |
title_fullStr | Inferring gene ontologies from pairwise similarity data |
title_full_unstemmed | Inferring gene ontologies from pairwise similarity data |
title_short | Inferring gene ontologies from pairwise similarity data |
title_sort | inferring gene ontologies from pairwise similarity data |
topic | Ismb 2014 Proceedings Papers Committee |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4058954/ https://www.ncbi.nlm.nih.gov/pubmed/24932003 http://dx.doi.org/10.1093/bioinformatics/btu282 |
work_keys_str_mv | AT kramermichael inferringgeneontologiesfrompairwisesimilaritydata AT dutkowskijanusz inferringgeneontologiesfrompairwisesimilaritydata AT yumichael inferringgeneontologiesfrompairwisesimilaritydata AT bafnavineet inferringgeneontologiesfrompairwisesimilaritydata AT idekertrey inferringgeneontologiesfrompairwisesimilaritydata |