Cargando…

Identifier mapping performance for integrating transcriptomics and proteomics experimental results

BACKGROUND: Studies integrating transcriptomic data with proteomic data can illuminate the proteome more clearly than either separately. Integromic studies can deepen understanding of the dynamic complex regulatory relationship between the transcriptome and the proteome. Integrating these data dicta...

Descripción completa

Detalles Bibliográficos
Autores principales: Day, Roger S, McDade, Kevin K, Chandran, Uma R, Lisovich, Alex, Conrads, Thomas P, Hood, Brian L, Kolli, VS Kumar, Kirchner, David, Litzi, Traci, Maxwell, G Larry
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3124437/
https://www.ncbi.nlm.nih.gov/pubmed/21619611
http://dx.doi.org/10.1186/1471-2105-12-213
_version_ 1782207088691249152
author Day, Roger S
McDade, Kevin K
Chandran, Uma R
Lisovich, Alex
Conrads, Thomas P
Hood, Brian L
Kolli, VS Kumar
Kirchner, David
Litzi, Traci
Maxwell, G Larry
author_facet Day, Roger S
McDade, Kevin K
Chandran, Uma R
Lisovich, Alex
Conrads, Thomas P
Hood, Brian L
Kolli, VS Kumar
Kirchner, David
Litzi, Traci
Maxwell, G Larry
author_sort Day, Roger S
collection PubMed
description BACKGROUND: Studies integrating transcriptomic data with proteomic data can illuminate the proteome more clearly than either separately. Integromic studies can deepen understanding of the dynamic complex regulatory relationship between the transcriptome and the proteome. Integrating these data dictates a reliable mapping between the identifier nomenclature resultant from the two high-throughput platforms. However, this kind of analysis is well known to be hampered by lack of standardization of identifier nomenclature among proteins, genes, and microarray probe sets. Therefore data integration may also play a role in critiquing the fallible gene identifications that both platforms emit. RESULTS: We compared three freely available internet-based identifier mapping resources for mapping UniProt accessions (ACCs) to Affymetrix probesets identifications (IDs): DAVID, EnVision, and NetAffx. Liquid chromatography-tandem mass spectrometry analyses of 91 endometrial cancer and 7 noncancer samples generated 11,879 distinct ACCs. For each ACC, we compared the retrieval sets of probeset IDs from each mapping resource. We confirmed a high level of discrepancy among the mapping resources. On the same samples, mRNA expression was available. Therefore, to evaluate the quality of each ACC-to-probeset match, we calculated proteome-transcriptome correlations, and compared the resources presuming that better mapping of identifiers should generate a higher proportion of mapped pairs with strong inter-platform correlations. A mixture model for the correlations fitted well and supported regression analysis, providing a window into the performance of the mapping resources. The resources have added and dropped matches over two years, but their overall performance has not changed. CONCLUSIONS: The methods presented here serve to achieve concrete context-specific insight, to support well-informed decisions in choosing an ID mapping strategy for "omic" data merging.
format Online
Article
Text
id pubmed-3124437
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31244372011-06-28 Identifier mapping performance for integrating transcriptomics and proteomics experimental results Day, Roger S McDade, Kevin K Chandran, Uma R Lisovich, Alex Conrads, Thomas P Hood, Brian L Kolli, VS Kumar Kirchner, David Litzi, Traci Maxwell, G Larry BMC Bioinformatics Methodology Article BACKGROUND: Studies integrating transcriptomic data with proteomic data can illuminate the proteome more clearly than either separately. Integromic studies can deepen understanding of the dynamic complex regulatory relationship between the transcriptome and the proteome. Integrating these data dictates a reliable mapping between the identifier nomenclature resultant from the two high-throughput platforms. However, this kind of analysis is well known to be hampered by lack of standardization of identifier nomenclature among proteins, genes, and microarray probe sets. Therefore data integration may also play a role in critiquing the fallible gene identifications that both platforms emit. RESULTS: We compared three freely available internet-based identifier mapping resources for mapping UniProt accessions (ACCs) to Affymetrix probesets identifications (IDs): DAVID, EnVision, and NetAffx. Liquid chromatography-tandem mass spectrometry analyses of 91 endometrial cancer and 7 noncancer samples generated 11,879 distinct ACCs. For each ACC, we compared the retrieval sets of probeset IDs from each mapping resource. We confirmed a high level of discrepancy among the mapping resources. On the same samples, mRNA expression was available. Therefore, to evaluate the quality of each ACC-to-probeset match, we calculated proteome-transcriptome correlations, and compared the resources presuming that better mapping of identifiers should generate a higher proportion of mapped pairs with strong inter-platform correlations. A mixture model for the correlations fitted well and supported regression analysis, providing a window into the performance of the mapping resources. The resources have added and dropped matches over two years, but their overall performance has not changed. CONCLUSIONS: The methods presented here serve to achieve concrete context-specific insight, to support well-informed decisions in choosing an ID mapping strategy for "omic" data merging. BioMed Central 2011-05-27 /pmc/articles/PMC3124437/ /pubmed/21619611 http://dx.doi.org/10.1186/1471-2105-12-213 Text en Copyright ©2011 Day et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Day, Roger S
McDade, Kevin K
Chandran, Uma R
Lisovich, Alex
Conrads, Thomas P
Hood, Brian L
Kolli, VS Kumar
Kirchner, David
Litzi, Traci
Maxwell, G Larry
Identifier mapping performance for integrating transcriptomics and proteomics experimental results
title Identifier mapping performance for integrating transcriptomics and proteomics experimental results
title_full Identifier mapping performance for integrating transcriptomics and proteomics experimental results
title_fullStr Identifier mapping performance for integrating transcriptomics and proteomics experimental results
title_full_unstemmed Identifier mapping performance for integrating transcriptomics and proteomics experimental results
title_short Identifier mapping performance for integrating transcriptomics and proteomics experimental results
title_sort identifier mapping performance for integrating transcriptomics and proteomics experimental results
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3124437/
https://www.ncbi.nlm.nih.gov/pubmed/21619611
http://dx.doi.org/10.1186/1471-2105-12-213
work_keys_str_mv AT dayrogers identifiermappingperformanceforintegratingtranscriptomicsandproteomicsexperimentalresults
AT mcdadekevink identifiermappingperformanceforintegratingtranscriptomicsandproteomicsexperimentalresults
AT chandranumar identifiermappingperformanceforintegratingtranscriptomicsandproteomicsexperimentalresults
AT lisovichalex identifiermappingperformanceforintegratingtranscriptomicsandproteomicsexperimentalresults
AT conradsthomasp identifiermappingperformanceforintegratingtranscriptomicsandproteomicsexperimentalresults
AT hoodbrianl identifiermappingperformanceforintegratingtranscriptomicsandproteomicsexperimentalresults
AT kollivskumar identifiermappingperformanceforintegratingtranscriptomicsandproteomicsexperimentalresults
AT kirchnerdavid identifiermappingperformanceforintegratingtranscriptomicsandproteomicsexperimentalresults
AT litzitraci identifiermappingperformanceforintegratingtranscriptomicsandproteomicsexperimentalresults
AT maxwellglarry identifiermappingperformanceforintegratingtranscriptomicsandproteomicsexperimentalresults