Cargando…

Natural Product Discovery Using Planes of Principal Component Analysis in R (PoPCAR)

Rediscovery of known natural products hinders the discovery of new, unique scaffolds. Efforts have mostly focused on streamlining the determination of what compounds are known vs. unknown (dereplication), but an alternative strategy is to focus on what is different. Utilizing statistics and assuming...

Descripción completa

Detalles Bibliográficos
Autores principales: Chanana, Shaurya, Thomas, Chris S., Braun, Doug R., Hou, Yanpeng, Wyche, Thomas P., Bugni, Tim S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5618319/
https://www.ncbi.nlm.nih.gov/pubmed/28703778
http://dx.doi.org/10.3390/metabo7030034
_version_ 1783267159642734592
author Chanana, Shaurya
Thomas, Chris S.
Braun, Doug R.
Hou, Yanpeng
Wyche, Thomas P.
Bugni, Tim S.
author_facet Chanana, Shaurya
Thomas, Chris S.
Braun, Doug R.
Hou, Yanpeng
Wyche, Thomas P.
Bugni, Tim S.
author_sort Chanana, Shaurya
collection PubMed
description Rediscovery of known natural products hinders the discovery of new, unique scaffolds. Efforts have mostly focused on streamlining the determination of what compounds are known vs. unknown (dereplication), but an alternative strategy is to focus on what is different. Utilizing statistics and assuming that common actinobacterial metabolites are likely known, focus can be shifted away from dereplication and towards discovery. LC-MS-based principal component analysis (PCA) provides a perfect tool to distinguish unique vs. common metabolites, but the variability inherent within natural products leads to datasets that do not fit ideal standards. To simplify the analysis of PCA models, we developed a script that identifies only those masses or molecules that are unique to each strain within a group, thereby greatly reducing the number of data points to be inspected manually. Since the script is written in R, it facilitates integration with other metabolomics workflows and supports automated mass matching to databases such as Antibase.
format Online
Article
Text
id pubmed-5618319
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-56183192017-09-29 Natural Product Discovery Using Planes of Principal Component Analysis in R (PoPCAR) Chanana, Shaurya Thomas, Chris S. Braun, Doug R. Hou, Yanpeng Wyche, Thomas P. Bugni, Tim S. Metabolites Article Rediscovery of known natural products hinders the discovery of new, unique scaffolds. Efforts have mostly focused on streamlining the determination of what compounds are known vs. unknown (dereplication), but an alternative strategy is to focus on what is different. Utilizing statistics and assuming that common actinobacterial metabolites are likely known, focus can be shifted away from dereplication and towards discovery. LC-MS-based principal component analysis (PCA) provides a perfect tool to distinguish unique vs. common metabolites, but the variability inherent within natural products leads to datasets that do not fit ideal standards. To simplify the analysis of PCA models, we developed a script that identifies only those masses or molecules that are unique to each strain within a group, thereby greatly reducing the number of data points to be inspected manually. Since the script is written in R, it facilitates integration with other metabolomics workflows and supports automated mass matching to databases such as Antibase. MDPI 2017-07-13 /pmc/articles/PMC5618319/ /pubmed/28703778 http://dx.doi.org/10.3390/metabo7030034 Text en © 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Chanana, Shaurya
Thomas, Chris S.
Braun, Doug R.
Hou, Yanpeng
Wyche, Thomas P.
Bugni, Tim S.
Natural Product Discovery Using Planes of Principal Component Analysis in R (PoPCAR)
title Natural Product Discovery Using Planes of Principal Component Analysis in R (PoPCAR)
title_full Natural Product Discovery Using Planes of Principal Component Analysis in R (PoPCAR)
title_fullStr Natural Product Discovery Using Planes of Principal Component Analysis in R (PoPCAR)
title_full_unstemmed Natural Product Discovery Using Planes of Principal Component Analysis in R (PoPCAR)
title_short Natural Product Discovery Using Planes of Principal Component Analysis in R (PoPCAR)
title_sort natural product discovery using planes of principal component analysis in r (popcar)
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5618319/
https://www.ncbi.nlm.nih.gov/pubmed/28703778
http://dx.doi.org/10.3390/metabo7030034
work_keys_str_mv AT chananashaurya naturalproductdiscoveryusingplanesofprincipalcomponentanalysisinrpopcar
AT thomaschriss naturalproductdiscoveryusingplanesofprincipalcomponentanalysisinrpopcar
AT braundougr naturalproductdiscoveryusingplanesofprincipalcomponentanalysisinrpopcar
AT houyanpeng naturalproductdiscoveryusingplanesofprincipalcomponentanalysisinrpopcar
AT wychethomasp naturalproductdiscoveryusingplanesofprincipalcomponentanalysisinrpopcar
AT bugnitims naturalproductdiscoveryusingplanesofprincipalcomponentanalysisinrpopcar