Cargando…

Assessment of Metabolome Annotation Quality: A Method for Evaluating the False Discovery Rate of Elemental Composition Searches

BACKGROUND: In metabolomics researches using mass spectrometry (MS), systematic searching of high-resolution mass data against compound databases is often the first step of metabolite annotation to determine elemental compositions possessing similar theoretical mass numbers. However, incorrect hits...

Descripción completa

Detalles Bibliográficos
Autores principales: Matsuda, Fumio, Shinbo, Yoko, Oikawa, Akira, Hirai, Masami Yokota, Fiehn, Oliver, Kanaya, Shigehiko, Saito, Kazuki
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2761541/
https://www.ncbi.nlm.nih.gov/pubmed/19847304
http://dx.doi.org/10.1371/journal.pone.0007490
_version_ 1782172840321089536
author Matsuda, Fumio
Shinbo, Yoko
Oikawa, Akira
Hirai, Masami Yokota
Fiehn, Oliver
Kanaya, Shigehiko
Saito, Kazuki
author_facet Matsuda, Fumio
Shinbo, Yoko
Oikawa, Akira
Hirai, Masami Yokota
Fiehn, Oliver
Kanaya, Shigehiko
Saito, Kazuki
author_sort Matsuda, Fumio
collection PubMed
description BACKGROUND: In metabolomics researches using mass spectrometry (MS), systematic searching of high-resolution mass data against compound databases is often the first step of metabolite annotation to determine elemental compositions possessing similar theoretical mass numbers. However, incorrect hits derived from errors in mass analyses will be included in the results of elemental composition searches. To assess the quality of peak annotation information, a novel methodology for false discovery rates (FDR) evaluation is presented in this study. Based on the FDR analyses, several aspects of an elemental composition search, including setting a threshold, estimating FDR, and the types of elemental composition databases most reliable for searching are discussed. METHODOLOGY/PRINCIPAL FINDINGS: The FDR can be determined from one measured value (i.e., the hit rate for search queries) and four parameters determined by Monte Carlo simulation. The results indicate that relatively high FDR values (30–50%) were obtained when searching time-of-flight (TOF)/MS data using the KNApSAcK and KEGG databases. In addition, searches against large all-in-one databases (e.g., PubChem) always produced unacceptable results (FDR >70%). The estimated FDRs suggest that the quality of search results can be improved not only by performing more accurate mass analysis but also by modifying the properties of the compound database. A theoretical analysis indicates that FDR could be improved by using compound database with smaller but higher completeness entries. CONCLUSIONS/SIGNIFICANCE: High accuracy mass analysis, such as Fourier transform (FT)-MS, is needed for reliable annotation (FDR <10%). In addition, a small, customized compound database is preferable for high-quality annotation of metabolome data.
format Text
id pubmed-2761541
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-27615412009-10-22 Assessment of Metabolome Annotation Quality: A Method for Evaluating the False Discovery Rate of Elemental Composition Searches Matsuda, Fumio Shinbo, Yoko Oikawa, Akira Hirai, Masami Yokota Fiehn, Oliver Kanaya, Shigehiko Saito, Kazuki PLoS One Research Article BACKGROUND: In metabolomics researches using mass spectrometry (MS), systematic searching of high-resolution mass data against compound databases is often the first step of metabolite annotation to determine elemental compositions possessing similar theoretical mass numbers. However, incorrect hits derived from errors in mass analyses will be included in the results of elemental composition searches. To assess the quality of peak annotation information, a novel methodology for false discovery rates (FDR) evaluation is presented in this study. Based on the FDR analyses, several aspects of an elemental composition search, including setting a threshold, estimating FDR, and the types of elemental composition databases most reliable for searching are discussed. METHODOLOGY/PRINCIPAL FINDINGS: The FDR can be determined from one measured value (i.e., the hit rate for search queries) and four parameters determined by Monte Carlo simulation. The results indicate that relatively high FDR values (30–50%) were obtained when searching time-of-flight (TOF)/MS data using the KNApSAcK and KEGG databases. In addition, searches against large all-in-one databases (e.g., PubChem) always produced unacceptable results (FDR >70%). The estimated FDRs suggest that the quality of search results can be improved not only by performing more accurate mass analysis but also by modifying the properties of the compound database. A theoretical analysis indicates that FDR could be improved by using compound database with smaller but higher completeness entries. CONCLUSIONS/SIGNIFICANCE: High accuracy mass analysis, such as Fourier transform (FT)-MS, is needed for reliable annotation (FDR <10%). In addition, a small, customized compound database is preferable for high-quality annotation of metabolome data. Public Library of Science 2009-10-16 /pmc/articles/PMC2761541/ /pubmed/19847304 http://dx.doi.org/10.1371/journal.pone.0007490 Text en Matsuda et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Matsuda, Fumio
Shinbo, Yoko
Oikawa, Akira
Hirai, Masami Yokota
Fiehn, Oliver
Kanaya, Shigehiko
Saito, Kazuki
Assessment of Metabolome Annotation Quality: A Method for Evaluating the False Discovery Rate of Elemental Composition Searches
title Assessment of Metabolome Annotation Quality: A Method for Evaluating the False Discovery Rate of Elemental Composition Searches
title_full Assessment of Metabolome Annotation Quality: A Method for Evaluating the False Discovery Rate of Elemental Composition Searches
title_fullStr Assessment of Metabolome Annotation Quality: A Method for Evaluating the False Discovery Rate of Elemental Composition Searches
title_full_unstemmed Assessment of Metabolome Annotation Quality: A Method for Evaluating the False Discovery Rate of Elemental Composition Searches
title_short Assessment of Metabolome Annotation Quality: A Method for Evaluating the False Discovery Rate of Elemental Composition Searches
title_sort assessment of metabolome annotation quality: a method for evaluating the false discovery rate of elemental composition searches
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2761541/
https://www.ncbi.nlm.nih.gov/pubmed/19847304
http://dx.doi.org/10.1371/journal.pone.0007490
work_keys_str_mv AT matsudafumio assessmentofmetabolomeannotationqualityamethodforevaluatingthefalsediscoveryrateofelementalcompositionsearches
AT shinboyoko assessmentofmetabolomeannotationqualityamethodforevaluatingthefalsediscoveryrateofelementalcompositionsearches
AT oikawaakira assessmentofmetabolomeannotationqualityamethodforevaluatingthefalsediscoveryrateofelementalcompositionsearches
AT hiraimasamiyokota assessmentofmetabolomeannotationqualityamethodforevaluatingthefalsediscoveryrateofelementalcompositionsearches
AT fiehnoliver assessmentofmetabolomeannotationqualityamethodforevaluatingthefalsediscoveryrateofelementalcompositionsearches
AT kanayashigehiko assessmentofmetabolomeannotationqualityamethodforevaluatingthefalsediscoveryrateofelementalcompositionsearches
AT saitokazuki assessmentofmetabolomeannotationqualityamethodforevaluatingthefalsediscoveryrateofelementalcompositionsearches