Cargando…

The use of missing values in proteomic data-independent acquisition mass spectrometry to enable disease activity discrimination

MOTIVATION: Data-independent acquisition mass spectrometry allows for comprehensive peptide detection and relative quantification than standard data-dependent approaches. While less prone to missing values, these still exist. Current approaches for handling the so-called missingness have challenges....

Descripción completa

Detalles Bibliográficos
Autores principales: McGurk, Kathryn A, Dagliati, Arianna, Chiasserini, Davide, Lee, Dave, Plant, Darren, Baricevic-Jones, Ivona, Kelsall, Janet, Eineman, Rachael, Reed, Rachel, Geary, Bethany, Unwin, Richard D, Nicolaou, Anna, Keavney, Bernard D, Barton, Anne, Whetton, Anthony D, Geifman, Nophar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7141869/
https://www.ncbi.nlm.nih.gov/pubmed/31790148
http://dx.doi.org/10.1093/bioinformatics/btz898
Descripción
Sumario:MOTIVATION: Data-independent acquisition mass spectrometry allows for comprehensive peptide detection and relative quantification than standard data-dependent approaches. While less prone to missing values, these still exist. Current approaches for handling the so-called missingness have challenges. We hypothesized that non-random missingness is a useful biological measure and demonstrate the importance of analysing missingness for proteomic discovery within a longitudinal study of disease activity. RESULTS: The magnitude of missingness did not correlate with mean peptide concentration. The magnitude of missingness for each protein strongly correlated between collection time points (baseline, 3 months, 6 months; R = 0.95–0.97, confidence interval = 0.94–0.97) indicating little time-dependent effect. This allowed for the identification of proteins with outlier levels of missingness that differentiate between the patient groups characterized by different patterns of disease activity. The association of these proteins with disease activity was confirmed by machine learning techniques. Our novel approach complements analyses on complete observations and other missing value strategies in biomarker prediction of disease activity. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.