Cargando…
Natural Product Discovery Using Planes of Principal Component Analysis in R (PoPCAR)
Rediscovery of known natural products hinders the discovery of new, unique scaffolds. Efforts have mostly focused on streamlining the determination of what compounds are known vs. unknown (dereplication), but an alternative strategy is to focus on what is different. Utilizing statistics and assuming...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5618319/ https://www.ncbi.nlm.nih.gov/pubmed/28703778 http://dx.doi.org/10.3390/metabo7030034 |
_version_ | 1783267159642734592 |
---|---|
author | Chanana, Shaurya Thomas, Chris S. Braun, Doug R. Hou, Yanpeng Wyche, Thomas P. Bugni, Tim S. |
author_facet | Chanana, Shaurya Thomas, Chris S. Braun, Doug R. Hou, Yanpeng Wyche, Thomas P. Bugni, Tim S. |
author_sort | Chanana, Shaurya |
collection | PubMed |
description | Rediscovery of known natural products hinders the discovery of new, unique scaffolds. Efforts have mostly focused on streamlining the determination of what compounds are known vs. unknown (dereplication), but an alternative strategy is to focus on what is different. Utilizing statistics and assuming that common actinobacterial metabolites are likely known, focus can be shifted away from dereplication and towards discovery. LC-MS-based principal component analysis (PCA) provides a perfect tool to distinguish unique vs. common metabolites, but the variability inherent within natural products leads to datasets that do not fit ideal standards. To simplify the analysis of PCA models, we developed a script that identifies only those masses or molecules that are unique to each strain within a group, thereby greatly reducing the number of data points to be inspected manually. Since the script is written in R, it facilitates integration with other metabolomics workflows and supports automated mass matching to databases such as Antibase. |
format | Online Article Text |
id | pubmed-5618319 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-56183192017-09-29 Natural Product Discovery Using Planes of Principal Component Analysis in R (PoPCAR) Chanana, Shaurya Thomas, Chris S. Braun, Doug R. Hou, Yanpeng Wyche, Thomas P. Bugni, Tim S. Metabolites Article Rediscovery of known natural products hinders the discovery of new, unique scaffolds. Efforts have mostly focused on streamlining the determination of what compounds are known vs. unknown (dereplication), but an alternative strategy is to focus on what is different. Utilizing statistics and assuming that common actinobacterial metabolites are likely known, focus can be shifted away from dereplication and towards discovery. LC-MS-based principal component analysis (PCA) provides a perfect tool to distinguish unique vs. common metabolites, but the variability inherent within natural products leads to datasets that do not fit ideal standards. To simplify the analysis of PCA models, we developed a script that identifies only those masses or molecules that are unique to each strain within a group, thereby greatly reducing the number of data points to be inspected manually. Since the script is written in R, it facilitates integration with other metabolomics workflows and supports automated mass matching to databases such as Antibase. MDPI 2017-07-13 /pmc/articles/PMC5618319/ /pubmed/28703778 http://dx.doi.org/10.3390/metabo7030034 Text en © 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Chanana, Shaurya Thomas, Chris S. Braun, Doug R. Hou, Yanpeng Wyche, Thomas P. Bugni, Tim S. Natural Product Discovery Using Planes of Principal Component Analysis in R (PoPCAR) |
title | Natural Product Discovery Using Planes of Principal Component Analysis in R (PoPCAR) |
title_full | Natural Product Discovery Using Planes of Principal Component Analysis in R (PoPCAR) |
title_fullStr | Natural Product Discovery Using Planes of Principal Component Analysis in R (PoPCAR) |
title_full_unstemmed | Natural Product Discovery Using Planes of Principal Component Analysis in R (PoPCAR) |
title_short | Natural Product Discovery Using Planes of Principal Component Analysis in R (PoPCAR) |
title_sort | natural product discovery using planes of principal component analysis in r (popcar) |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5618319/ https://www.ncbi.nlm.nih.gov/pubmed/28703778 http://dx.doi.org/10.3390/metabo7030034 |
work_keys_str_mv | AT chananashaurya naturalproductdiscoveryusingplanesofprincipalcomponentanalysisinrpopcar AT thomaschriss naturalproductdiscoveryusingplanesofprincipalcomponentanalysisinrpopcar AT braundougr naturalproductdiscoveryusingplanesofprincipalcomponentanalysisinrpopcar AT houyanpeng naturalproductdiscoveryusingplanesofprincipalcomponentanalysisinrpopcar AT wychethomasp naturalproductdiscoveryusingplanesofprincipalcomponentanalysisinrpopcar AT bugnitims naturalproductdiscoveryusingplanesofprincipalcomponentanalysisinrpopcar |