Cargando…

From chemoproteomic‐detected amino acids to genomic coordinates: insights into precise multi‐omic data integration

The integration of proteomic, transcriptomic, and genetic variant annotation data will improve our understanding of genotype–phenotype associations. Due, in part, to challenges associated with accurate inter‐database mapping, such multi‐omic studies have not extended to chemoproteomics, a method tha...

Descripción completa

Detalles Bibliográficos
Autores principales: Palafox, Maria F, Desai, Heta S, Arboleda, Valerie A, Backus, Keriann M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7890448/
https://www.ncbi.nlm.nih.gov/pubmed/33599394
http://dx.doi.org/10.15252/msb.20209840
_version_ 1783652516103192576
author Palafox, Maria F
Desai, Heta S
Arboleda, Valerie A
Backus, Keriann M
author_facet Palafox, Maria F
Desai, Heta S
Arboleda, Valerie A
Backus, Keriann M
author_sort Palafox, Maria F
collection PubMed
description The integration of proteomic, transcriptomic, and genetic variant annotation data will improve our understanding of genotype–phenotype associations. Due, in part, to challenges associated with accurate inter‐database mapping, such multi‐omic studies have not extended to chemoproteomics, a method that measures the intrinsic reactivity and potential “druggability” of nucleophilic amino acid side chains. Here, we evaluated mapping approaches to match chemoproteomic‐detected cysteine and lysine residues with their genetic coordinates. Our analysis revealed that database update cycles and reliance on stable identifiers can lead to pervasive misidentification of labeled residues. Enabled by this examination of mapping strategies, we then integrated our chemoproteomics data with computational methods for predicting genetic variant pathogenicity, which revealed that codons of highly reactive cysteines are enriched for genetic variants that are predicted to be more deleterious and allowed us to identify and functionally characterize a new damaging residue in the cysteine protease caspase‐8. Our study provides a roadmap for more precise inter‐database mapping and points to untapped opportunities to improve the predictive power of pathogenicity scores and to advance prioritization of putative druggable sites.
format Online
Article
Text
id pubmed-7890448
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-78904482021-03-02 From chemoproteomic‐detected amino acids to genomic coordinates: insights into precise multi‐omic data integration Palafox, Maria F Desai, Heta S Arboleda, Valerie A Backus, Keriann M Mol Syst Biol Articles The integration of proteomic, transcriptomic, and genetic variant annotation data will improve our understanding of genotype–phenotype associations. Due, in part, to challenges associated with accurate inter‐database mapping, such multi‐omic studies have not extended to chemoproteomics, a method that measures the intrinsic reactivity and potential “druggability” of nucleophilic amino acid side chains. Here, we evaluated mapping approaches to match chemoproteomic‐detected cysteine and lysine residues with their genetic coordinates. Our analysis revealed that database update cycles and reliance on stable identifiers can lead to pervasive misidentification of labeled residues. Enabled by this examination of mapping strategies, we then integrated our chemoproteomics data with computational methods for predicting genetic variant pathogenicity, which revealed that codons of highly reactive cysteines are enriched for genetic variants that are predicted to be more deleterious and allowed us to identify and functionally characterize a new damaging residue in the cysteine protease caspase‐8. Our study provides a roadmap for more precise inter‐database mapping and points to untapped opportunities to improve the predictive power of pathogenicity scores and to advance prioritization of putative druggable sites. John Wiley and Sons Inc. 2021-02-18 /pmc/articles/PMC7890448/ /pubmed/33599394 http://dx.doi.org/10.15252/msb.20209840 Text en © 2021 The Authors. Published under the terms of the CC BY 4.0 license This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
Palafox, Maria F
Desai, Heta S
Arboleda, Valerie A
Backus, Keriann M
From chemoproteomic‐detected amino acids to genomic coordinates: insights into precise multi‐omic data integration
title From chemoproteomic‐detected amino acids to genomic coordinates: insights into precise multi‐omic data integration
title_full From chemoproteomic‐detected amino acids to genomic coordinates: insights into precise multi‐omic data integration
title_fullStr From chemoproteomic‐detected amino acids to genomic coordinates: insights into precise multi‐omic data integration
title_full_unstemmed From chemoproteomic‐detected amino acids to genomic coordinates: insights into precise multi‐omic data integration
title_short From chemoproteomic‐detected amino acids to genomic coordinates: insights into precise multi‐omic data integration
title_sort from chemoproteomic‐detected amino acids to genomic coordinates: insights into precise multi‐omic data integration
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7890448/
https://www.ncbi.nlm.nih.gov/pubmed/33599394
http://dx.doi.org/10.15252/msb.20209840
work_keys_str_mv AT palafoxmariaf fromchemoproteomicdetectedaminoacidstogenomiccoordinatesinsightsintoprecisemultiomicdataintegration
AT desaihetas fromchemoproteomicdetectedaminoacidstogenomiccoordinatesinsightsintoprecisemultiomicdataintegration
AT arboledavaleriea fromchemoproteomicdetectedaminoacidstogenomiccoordinatesinsightsintoprecisemultiomicdataintegration
AT backuskeriannm fromchemoproteomicdetectedaminoacidstogenomiccoordinatesinsightsintoprecisemultiomicdataintegration