Cargando…

The quality of metabolic pathway resources depends on initial enzymatic function assignments: a case for maize

BACKGROUND: As metabolic pathway resources become more commonly available, researchers have unprecedented access to information about their organism of interest. Despite efforts to ensure consistency between various resources, information content and quality can vary widely. Two maize metabolic path...

Descripción completa

Detalles Bibliográficos
Autores principales: Walsh, Jesse R., Schaeffer, Mary L., Zhang, Peifen, Rhee, Seung Y., Dickerson, Julie A., Sen, Taner Z.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5129634/
https://www.ncbi.nlm.nih.gov/pubmed/27899149
http://dx.doi.org/10.1186/s12918-016-0369-x
_version_ 1782470623792988160
author Walsh, Jesse R.
Schaeffer, Mary L.
Zhang, Peifen
Rhee, Seung Y.
Dickerson, Julie A.
Sen, Taner Z.
author_facet Walsh, Jesse R.
Schaeffer, Mary L.
Zhang, Peifen
Rhee, Seung Y.
Dickerson, Julie A.
Sen, Taner Z.
author_sort Walsh, Jesse R.
collection PubMed
description BACKGROUND: As metabolic pathway resources become more commonly available, researchers have unprecedented access to information about their organism of interest. Despite efforts to ensure consistency between various resources, information content and quality can vary widely. Two maize metabolic pathway resources for the B73 inbred line, CornCyc 4.0 and MaizeCyc 2.2, are based on the same gene model set and were developed using Pathway Tools software. These resources differ in their initial enzymatic function assignments and in the extent of manual curation. We present an in-depth comparison between CornCyc and MaizeCyc to demonstrate the effect of initial computational enzymatic function assignments on the quality and content of metabolic pathway resources. RESULTS: These two resources are different in their content. MaizeCyc contains GO annotations for over 21,000 genes that CornCyc is missing. CornCyc contains on average 1.6 transcripts per gene, while MaizeCyc contains almost no alternate splicing. MaizeCyc also does not match CornCyc’s breadth in representing the metabolic domain; MaizeCyc has fewer compounds, reactions, and pathways than CornCyc. CornCyc’s computational predictions are more accurate than those in MaizeCyc when compared to experimentally determined function assignments, demonstrating the relative strength of the enzymatic function assignment pipeline used to generate CornCyc. CONCLUSIONS: Our results show that the quality of initial enzymatic function assignments primarily determines the quality of the final metabolic pathway resource. Therefore, biologists should pay close attention to the methods and information sources used to develop a metabolic pathway resource to gauge the utility of using such functional assignments to construct hypotheses for experimental studies. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12918-016-0369-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5129634
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-51296342016-12-12 The quality of metabolic pathway resources depends on initial enzymatic function assignments: a case for maize Walsh, Jesse R. Schaeffer, Mary L. Zhang, Peifen Rhee, Seung Y. Dickerson, Julie A. Sen, Taner Z. BMC Syst Biol Research Article BACKGROUND: As metabolic pathway resources become more commonly available, researchers have unprecedented access to information about their organism of interest. Despite efforts to ensure consistency between various resources, information content and quality can vary widely. Two maize metabolic pathway resources for the B73 inbred line, CornCyc 4.0 and MaizeCyc 2.2, are based on the same gene model set and were developed using Pathway Tools software. These resources differ in their initial enzymatic function assignments and in the extent of manual curation. We present an in-depth comparison between CornCyc and MaizeCyc to demonstrate the effect of initial computational enzymatic function assignments on the quality and content of metabolic pathway resources. RESULTS: These two resources are different in their content. MaizeCyc contains GO annotations for over 21,000 genes that CornCyc is missing. CornCyc contains on average 1.6 transcripts per gene, while MaizeCyc contains almost no alternate splicing. MaizeCyc also does not match CornCyc’s breadth in representing the metabolic domain; MaizeCyc has fewer compounds, reactions, and pathways than CornCyc. CornCyc’s computational predictions are more accurate than those in MaizeCyc when compared to experimentally determined function assignments, demonstrating the relative strength of the enzymatic function assignment pipeline used to generate CornCyc. CONCLUSIONS: Our results show that the quality of initial enzymatic function assignments primarily determines the quality of the final metabolic pathway resource. Therefore, biologists should pay close attention to the methods and information sources used to develop a metabolic pathway resource to gauge the utility of using such functional assignments to construct hypotheses for experimental studies. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12918-016-0369-x) contains supplementary material, which is available to authorized users. BioMed Central 2016-11-29 /pmc/articles/PMC5129634/ /pubmed/27899149 http://dx.doi.org/10.1186/s12918-016-0369-x Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Walsh, Jesse R.
Schaeffer, Mary L.
Zhang, Peifen
Rhee, Seung Y.
Dickerson, Julie A.
Sen, Taner Z.
The quality of metabolic pathway resources depends on initial enzymatic function assignments: a case for maize
title The quality of metabolic pathway resources depends on initial enzymatic function assignments: a case for maize
title_full The quality of metabolic pathway resources depends on initial enzymatic function assignments: a case for maize
title_fullStr The quality of metabolic pathway resources depends on initial enzymatic function assignments: a case for maize
title_full_unstemmed The quality of metabolic pathway resources depends on initial enzymatic function assignments: a case for maize
title_short The quality of metabolic pathway resources depends on initial enzymatic function assignments: a case for maize
title_sort quality of metabolic pathway resources depends on initial enzymatic function assignments: a case for maize
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5129634/
https://www.ncbi.nlm.nih.gov/pubmed/27899149
http://dx.doi.org/10.1186/s12918-016-0369-x
work_keys_str_mv AT walshjesser thequalityofmetabolicpathwayresourcesdependsoninitialenzymaticfunctionassignmentsacaseformaize
AT schaeffermaryl thequalityofmetabolicpathwayresourcesdependsoninitialenzymaticfunctionassignmentsacaseformaize
AT zhangpeifen thequalityofmetabolicpathwayresourcesdependsoninitialenzymaticfunctionassignmentsacaseformaize
AT rheeseungy thequalityofmetabolicpathwayresourcesdependsoninitialenzymaticfunctionassignmentsacaseformaize
AT dickersonjuliea thequalityofmetabolicpathwayresourcesdependsoninitialenzymaticfunctionassignmentsacaseformaize
AT sentanerz thequalityofmetabolicpathwayresourcesdependsoninitialenzymaticfunctionassignmentsacaseformaize
AT walshjesser qualityofmetabolicpathwayresourcesdependsoninitialenzymaticfunctionassignmentsacaseformaize
AT schaeffermaryl qualityofmetabolicpathwayresourcesdependsoninitialenzymaticfunctionassignmentsacaseformaize
AT zhangpeifen qualityofmetabolicpathwayresourcesdependsoninitialenzymaticfunctionassignmentsacaseformaize
AT rheeseungy qualityofmetabolicpathwayresourcesdependsoninitialenzymaticfunctionassignmentsacaseformaize
AT dickersonjuliea qualityofmetabolicpathwayresourcesdependsoninitialenzymaticfunctionassignmentsacaseformaize
AT sentanerz qualityofmetabolicpathwayresourcesdependsoninitialenzymaticfunctionassignmentsacaseformaize