Cargando…

GO Trimming: Systematically reducing redundancy in large Gene Ontology datasets

BACKGROUND: The increased accessibility of gene expression tools has enabled a wide variety of experiments utilizing transcriptomic analyses. As these tools increase in prevalence, the need for improved standardization in processing and presentation of data increases, as does the need to guard again...

Descripción completa

Detalles Bibliográficos
Autores principales:	Jantzen, Stuart G, Sutherland, Ben JG, Minkley, David R, Koop, Ben F
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2011
Materias:	Technical Note
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3160396/ https://www.ncbi.nlm.nih.gov/pubmed/21798041 http://dx.doi.org/10.1186/1756-0500-4-267

_version_	1782210552322326528
author	Jantzen, Stuart G Sutherland, Ben JG Minkley, David R Koop, Ben F
author_facet	Jantzen, Stuart G Sutherland, Ben JG Minkley, David R Koop, Ben F
author_sort	Jantzen, Stuart G
collection	PubMed
description	BACKGROUND: The increased accessibility of gene expression tools has enabled a wide variety of experiments utilizing transcriptomic analyses. As these tools increase in prevalence, the need for improved standardization in processing and presentation of data increases, as does the need to guard against interpretation bias. Gene Ontology (GO) analysis is a powerful method of interpreting and summarizing biological functions. However, while there are many tools available to investigate GO enrichment, there remains a need for methods that directly remove redundant terms from enriched GO lists that often provide little, if any, additional information. FINDINGS: Here we present a simple yet novel method called GO Trimming that utilizes an algorithm designed to reduce redundancy in lists of enriched GO categories. Depending on the needs of the user, this method can be performed with variable stringency. In the example presented here, an initial list of 90 terms was reduced to 54, eliminating 36 largely redundant terms. We also compare this method to existing methods and find that GO Trimming, while simple, performs well to eliminate redundant terms in a large dataset throughout the depth of the GO hierarchy. CONCLUSIONS: The GO Trimming method provides an alternative to other procedures, some of which involve removing large numbers of terms prior to enrichment analysis. This method should free up the researcher from analyzing overly large, redundant lists, and instead enable the concise presentation of manageable, informative GO lists. The implementation of this tool is freely available at: http://lucy.ceh.uvic.ca/go_trimming/cbr_go_trimming.py
format	Online Article Text
id	pubmed-3160396
institution	National Center for Biotechnology Information
language	English
publishDate	2011
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-31603962011-08-24 GO Trimming: Systematically reducing redundancy in large Gene Ontology datasets Jantzen, Stuart G Sutherland, Ben JG Minkley, David R Koop, Ben F BMC Res Notes Technical Note BACKGROUND: The increased accessibility of gene expression tools has enabled a wide variety of experiments utilizing transcriptomic analyses. As these tools increase in prevalence, the need for improved standardization in processing and presentation of data increases, as does the need to guard against interpretation bias. Gene Ontology (GO) analysis is a powerful method of interpreting and summarizing biological functions. However, while there are many tools available to investigate GO enrichment, there remains a need for methods that directly remove redundant terms from enriched GO lists that often provide little, if any, additional information. FINDINGS: Here we present a simple yet novel method called GO Trimming that utilizes an algorithm designed to reduce redundancy in lists of enriched GO categories. Depending on the needs of the user, this method can be performed with variable stringency. In the example presented here, an initial list of 90 terms was reduced to 54, eliminating 36 largely redundant terms. We also compare this method to existing methods and find that GO Trimming, while simple, performs well to eliminate redundant terms in a large dataset throughout the depth of the GO hierarchy. CONCLUSIONS: The GO Trimming method provides an alternative to other procedures, some of which involve removing large numbers of terms prior to enrichment analysis. This method should free up the researcher from analyzing overly large, redundant lists, and instead enable the concise presentation of manageable, informative GO lists. The implementation of this tool is freely available at: http://lucy.ceh.uvic.ca/go_trimming/cbr_go_trimming.py BioMed Central 2011-07-28 /pmc/articles/PMC3160396/ /pubmed/21798041 http://dx.doi.org/10.1186/1756-0500-4-267 Text en Copyright ©2011 Koop et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Technical Note Jantzen, Stuart G Sutherland, Ben JG Minkley, David R Koop, Ben F GO Trimming: Systematically reducing redundancy in large Gene Ontology datasets
title	GO Trimming: Systematically reducing redundancy in large Gene Ontology datasets
title_full	GO Trimming: Systematically reducing redundancy in large Gene Ontology datasets
title_fullStr	GO Trimming: Systematically reducing redundancy in large Gene Ontology datasets
title_full_unstemmed	GO Trimming: Systematically reducing redundancy in large Gene Ontology datasets
title_short	GO Trimming: Systematically reducing redundancy in large Gene Ontology datasets
title_sort	go trimming: systematically reducing redundancy in large gene ontology datasets
topic	Technical Note
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3160396/ https://www.ncbi.nlm.nih.gov/pubmed/21798041 http://dx.doi.org/10.1186/1756-0500-4-267
work_keys_str_mv	AT jantzenstuartg gotrimmingsystematicallyreducingredundancyinlargegeneontologydatasets AT sutherlandbenjg gotrimmingsystematicallyreducingredundancyinlargegeneontologydatasets AT minkleydavidr gotrimmingsystematicallyreducingredundancyinlargegeneontologydatasets AT koopbenf gotrimmingsystematicallyreducingredundancyinlargegeneontologydatasets

GO Trimming: Systematically reducing redundancy in large Gene Ontology datasets

Ejemplares similares