Cargando…

Strategy for improved characterization of human metabolic phenotypes using a COmbined Multi-block Principal components Analysis with Statistical Spectroscopy (COMPASS)

MOTIVATION: Large-scale population omics data can provide insight into associations between gene–environment interactions and disease. However, existing dimension reduction modelling techniques are often inefficient for extracting detailed information from these complex datasets. RESULTS: Here, we p...

Descripción completa

Detalles Bibliográficos
Autores principales: Loo, Ruey Leng, Chan, Queenie, Antti, Henrik, Li, Jia V, Ashrafian, H, Elliott, Paul, Stamler, Jeremiah, Nicholson, Jeremy K, Holmes, Elaine, Wist, Julien
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7850059/
https://www.ncbi.nlm.nih.gov/pubmed/32692809
http://dx.doi.org/10.1093/bioinformatics/btaa649
_version_ 1783645399734550528
author Loo, Ruey Leng
Chan, Queenie
Antti, Henrik
Li, Jia V
Ashrafian, H
Elliott, Paul
Stamler, Jeremiah
Nicholson, Jeremy K
Holmes, Elaine
Wist, Julien
author_facet Loo, Ruey Leng
Chan, Queenie
Antti, Henrik
Li, Jia V
Ashrafian, H
Elliott, Paul
Stamler, Jeremiah
Nicholson, Jeremy K
Holmes, Elaine
Wist, Julien
author_sort Loo, Ruey Leng
collection PubMed
description MOTIVATION: Large-scale population omics data can provide insight into associations between gene–environment interactions and disease. However, existing dimension reduction modelling techniques are often inefficient for extracting detailed information from these complex datasets. RESULTS: Here, we present an interactive software pipeline for exploratory analyses of population-based nuclear magnetic resonance spectral data using a COmbined Multi-block Principal components Analysis with Statistical Spectroscopy (COMPASS) within the R-library hastaLaVista framework. Principal component analysis models are generated for a sequential series of spectral regions (blocks) to provide more granular detail defining sub-populations within the dataset. Molecular identification of key differentiating signals is subsequently achieved by implementing Statistical TOtal Correlation SpectroscopY on the full spectral data to define feature patterns. Finally, the distributions of cross-correlation of the reference patterns across the spectral dataset are used to provide population statistics for identifying underlying features arising from drug intake, latent diseases and diet. The COMPASS method thus provides an efficient semi-automated approach for screening population datasets. AVAILABILITY AND IMPLEMENTATION: Source code is available at https://github.com/cheminfo/COMPASS. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-7850059
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-78500592021-02-03 Strategy for improved characterization of human metabolic phenotypes using a COmbined Multi-block Principal components Analysis with Statistical Spectroscopy (COMPASS) Loo, Ruey Leng Chan, Queenie Antti, Henrik Li, Jia V Ashrafian, H Elliott, Paul Stamler, Jeremiah Nicholson, Jeremy K Holmes, Elaine Wist, Julien Bioinformatics Original Papers MOTIVATION: Large-scale population omics data can provide insight into associations between gene–environment interactions and disease. However, existing dimension reduction modelling techniques are often inefficient for extracting detailed information from these complex datasets. RESULTS: Here, we present an interactive software pipeline for exploratory analyses of population-based nuclear magnetic resonance spectral data using a COmbined Multi-block Principal components Analysis with Statistical Spectroscopy (COMPASS) within the R-library hastaLaVista framework. Principal component analysis models are generated for a sequential series of spectral regions (blocks) to provide more granular detail defining sub-populations within the dataset. Molecular identification of key differentiating signals is subsequently achieved by implementing Statistical TOtal Correlation SpectroscopY on the full spectral data to define feature patterns. Finally, the distributions of cross-correlation of the reference patterns across the spectral dataset are used to provide population statistics for identifying underlying features arising from drug intake, latent diseases and diet. The COMPASS method thus provides an efficient semi-automated approach for screening population datasets. AVAILABILITY AND IMPLEMENTATION: Source code is available at https://github.com/cheminfo/COMPASS. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-07-21 /pmc/articles/PMC7850059/ /pubmed/32692809 http://dx.doi.org/10.1093/bioinformatics/btaa649 Text en © The Author(s) 2020. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Loo, Ruey Leng
Chan, Queenie
Antti, Henrik
Li, Jia V
Ashrafian, H
Elliott, Paul
Stamler, Jeremiah
Nicholson, Jeremy K
Holmes, Elaine
Wist, Julien
Strategy for improved characterization of human metabolic phenotypes using a COmbined Multi-block Principal components Analysis with Statistical Spectroscopy (COMPASS)
title Strategy for improved characterization of human metabolic phenotypes using a COmbined Multi-block Principal components Analysis with Statistical Spectroscopy (COMPASS)
title_full Strategy for improved characterization of human metabolic phenotypes using a COmbined Multi-block Principal components Analysis with Statistical Spectroscopy (COMPASS)
title_fullStr Strategy for improved characterization of human metabolic phenotypes using a COmbined Multi-block Principal components Analysis with Statistical Spectroscopy (COMPASS)
title_full_unstemmed Strategy for improved characterization of human metabolic phenotypes using a COmbined Multi-block Principal components Analysis with Statistical Spectroscopy (COMPASS)
title_short Strategy for improved characterization of human metabolic phenotypes using a COmbined Multi-block Principal components Analysis with Statistical Spectroscopy (COMPASS)
title_sort strategy for improved characterization of human metabolic phenotypes using a combined multi-block principal components analysis with statistical spectroscopy (compass)
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7850059/
https://www.ncbi.nlm.nih.gov/pubmed/32692809
http://dx.doi.org/10.1093/bioinformatics/btaa649
work_keys_str_mv AT loorueyleng strategyforimprovedcharacterizationofhumanmetabolicphenotypesusingacombinedmultiblockprincipalcomponentsanalysiswithstatisticalspectroscopycompass
AT chanqueenie strategyforimprovedcharacterizationofhumanmetabolicphenotypesusingacombinedmultiblockprincipalcomponentsanalysiswithstatisticalspectroscopycompass
AT anttihenrik strategyforimprovedcharacterizationofhumanmetabolicphenotypesusingacombinedmultiblockprincipalcomponentsanalysiswithstatisticalspectroscopycompass
AT lijiav strategyforimprovedcharacterizationofhumanmetabolicphenotypesusingacombinedmultiblockprincipalcomponentsanalysiswithstatisticalspectroscopycompass
AT ashrafianh strategyforimprovedcharacterizationofhumanmetabolicphenotypesusingacombinedmultiblockprincipalcomponentsanalysiswithstatisticalspectroscopycompass
AT elliottpaul strategyforimprovedcharacterizationofhumanmetabolicphenotypesusingacombinedmultiblockprincipalcomponentsanalysiswithstatisticalspectroscopycompass
AT stamlerjeremiah strategyforimprovedcharacterizationofhumanmetabolicphenotypesusingacombinedmultiblockprincipalcomponentsanalysiswithstatisticalspectroscopycompass
AT nicholsonjeremyk strategyforimprovedcharacterizationofhumanmetabolicphenotypesusingacombinedmultiblockprincipalcomponentsanalysiswithstatisticalspectroscopycompass
AT holmeselaine strategyforimprovedcharacterizationofhumanmetabolicphenotypesusingacombinedmultiblockprincipalcomponentsanalysiswithstatisticalspectroscopycompass
AT wistjulien strategyforimprovedcharacterizationofhumanmetabolicphenotypesusingacombinedmultiblockprincipalcomponentsanalysiswithstatisticalspectroscopycompass