Cargando…
Strategy for improved characterization of human metabolic phenotypes using a COmbined Multi-block Principal components Analysis with Statistical Spectroscopy (COMPASS)
MOTIVATION: Large-scale population omics data can provide insight into associations between gene–environment interactions and disease. However, existing dimension reduction modelling techniques are often inefficient for extracting detailed information from these complex datasets. RESULTS: Here, we p...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7850059/ https://www.ncbi.nlm.nih.gov/pubmed/32692809 http://dx.doi.org/10.1093/bioinformatics/btaa649 |
_version_ | 1783645399734550528 |
---|---|
author | Loo, Ruey Leng Chan, Queenie Antti, Henrik Li, Jia V Ashrafian, H Elliott, Paul Stamler, Jeremiah Nicholson, Jeremy K Holmes, Elaine Wist, Julien |
author_facet | Loo, Ruey Leng Chan, Queenie Antti, Henrik Li, Jia V Ashrafian, H Elliott, Paul Stamler, Jeremiah Nicholson, Jeremy K Holmes, Elaine Wist, Julien |
author_sort | Loo, Ruey Leng |
collection | PubMed |
description | MOTIVATION: Large-scale population omics data can provide insight into associations between gene–environment interactions and disease. However, existing dimension reduction modelling techniques are often inefficient for extracting detailed information from these complex datasets. RESULTS: Here, we present an interactive software pipeline for exploratory analyses of population-based nuclear magnetic resonance spectral data using a COmbined Multi-block Principal components Analysis with Statistical Spectroscopy (COMPASS) within the R-library hastaLaVista framework. Principal component analysis models are generated for a sequential series of spectral regions (blocks) to provide more granular detail defining sub-populations within the dataset. Molecular identification of key differentiating signals is subsequently achieved by implementing Statistical TOtal Correlation SpectroscopY on the full spectral data to define feature patterns. Finally, the distributions of cross-correlation of the reference patterns across the spectral dataset are used to provide population statistics for identifying underlying features arising from drug intake, latent diseases and diet. The COMPASS method thus provides an efficient semi-automated approach for screening population datasets. AVAILABILITY AND IMPLEMENTATION: Source code is available at https://github.com/cheminfo/COMPASS. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-7850059 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-78500592021-02-03 Strategy for improved characterization of human metabolic phenotypes using a COmbined Multi-block Principal components Analysis with Statistical Spectroscopy (COMPASS) Loo, Ruey Leng Chan, Queenie Antti, Henrik Li, Jia V Ashrafian, H Elliott, Paul Stamler, Jeremiah Nicholson, Jeremy K Holmes, Elaine Wist, Julien Bioinformatics Original Papers MOTIVATION: Large-scale population omics data can provide insight into associations between gene–environment interactions and disease. However, existing dimension reduction modelling techniques are often inefficient for extracting detailed information from these complex datasets. RESULTS: Here, we present an interactive software pipeline for exploratory analyses of population-based nuclear magnetic resonance spectral data using a COmbined Multi-block Principal components Analysis with Statistical Spectroscopy (COMPASS) within the R-library hastaLaVista framework. Principal component analysis models are generated for a sequential series of spectral regions (blocks) to provide more granular detail defining sub-populations within the dataset. Molecular identification of key differentiating signals is subsequently achieved by implementing Statistical TOtal Correlation SpectroscopY on the full spectral data to define feature patterns. Finally, the distributions of cross-correlation of the reference patterns across the spectral dataset are used to provide population statistics for identifying underlying features arising from drug intake, latent diseases and diet. The COMPASS method thus provides an efficient semi-automated approach for screening population datasets. AVAILABILITY AND IMPLEMENTATION: Source code is available at https://github.com/cheminfo/COMPASS. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-07-21 /pmc/articles/PMC7850059/ /pubmed/32692809 http://dx.doi.org/10.1093/bioinformatics/btaa649 Text en © The Author(s) 2020. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Loo, Ruey Leng Chan, Queenie Antti, Henrik Li, Jia V Ashrafian, H Elliott, Paul Stamler, Jeremiah Nicholson, Jeremy K Holmes, Elaine Wist, Julien Strategy for improved characterization of human metabolic phenotypes using a COmbined Multi-block Principal components Analysis with Statistical Spectroscopy (COMPASS) |
title | Strategy for improved characterization of human metabolic phenotypes using a COmbined Multi-block Principal components Analysis with Statistical Spectroscopy (COMPASS) |
title_full | Strategy for improved characterization of human metabolic phenotypes using a COmbined Multi-block Principal components Analysis with Statistical Spectroscopy (COMPASS) |
title_fullStr | Strategy for improved characterization of human metabolic phenotypes using a COmbined Multi-block Principal components Analysis with Statistical Spectroscopy (COMPASS) |
title_full_unstemmed | Strategy for improved characterization of human metabolic phenotypes using a COmbined Multi-block Principal components Analysis with Statistical Spectroscopy (COMPASS) |
title_short | Strategy for improved characterization of human metabolic phenotypes using a COmbined Multi-block Principal components Analysis with Statistical Spectroscopy (COMPASS) |
title_sort | strategy for improved characterization of human metabolic phenotypes using a combined multi-block principal components analysis with statistical spectroscopy (compass) |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7850059/ https://www.ncbi.nlm.nih.gov/pubmed/32692809 http://dx.doi.org/10.1093/bioinformatics/btaa649 |
work_keys_str_mv | AT loorueyleng strategyforimprovedcharacterizationofhumanmetabolicphenotypesusingacombinedmultiblockprincipalcomponentsanalysiswithstatisticalspectroscopycompass AT chanqueenie strategyforimprovedcharacterizationofhumanmetabolicphenotypesusingacombinedmultiblockprincipalcomponentsanalysiswithstatisticalspectroscopycompass AT anttihenrik strategyforimprovedcharacterizationofhumanmetabolicphenotypesusingacombinedmultiblockprincipalcomponentsanalysiswithstatisticalspectroscopycompass AT lijiav strategyforimprovedcharacterizationofhumanmetabolicphenotypesusingacombinedmultiblockprincipalcomponentsanalysiswithstatisticalspectroscopycompass AT ashrafianh strategyforimprovedcharacterizationofhumanmetabolicphenotypesusingacombinedmultiblockprincipalcomponentsanalysiswithstatisticalspectroscopycompass AT elliottpaul strategyforimprovedcharacterizationofhumanmetabolicphenotypesusingacombinedmultiblockprincipalcomponentsanalysiswithstatisticalspectroscopycompass AT stamlerjeremiah strategyforimprovedcharacterizationofhumanmetabolicphenotypesusingacombinedmultiblockprincipalcomponentsanalysiswithstatisticalspectroscopycompass AT nicholsonjeremyk strategyforimprovedcharacterizationofhumanmetabolicphenotypesusingacombinedmultiblockprincipalcomponentsanalysiswithstatisticalspectroscopycompass AT holmeselaine strategyforimprovedcharacterizationofhumanmetabolicphenotypesusingacombinedmultiblockprincipalcomponentsanalysiswithstatisticalspectroscopycompass AT wistjulien strategyforimprovedcharacterizationofhumanmetabolicphenotypesusingacombinedmultiblockprincipalcomponentsanalysiswithstatisticalspectroscopycompass |