Cargando…
Identification of biomarkers that distinguish chemical contaminants based on gene expression profiles
BACKGROUND: High throughput transcriptomics profiles such as those generated using microarrays have been useful in identifying biomarkers for different classification and toxicity prediction purposes. Here, we investigated the use of microarrays to predict chemical toxicants and their possible mecha...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4051169/ https://www.ncbi.nlm.nih.gov/pubmed/24678894 http://dx.doi.org/10.1186/1471-2164-15-248 |
_version_ | 1782320071286194176 |
---|---|
author | Wei, Xiaomou Ai, Junmei Deng, Youping Guan, Xin Johnson, David R Ang, Choo Y Zhang, Chaoyang Perkins, Edward J |
author_facet | Wei, Xiaomou Ai, Junmei Deng, Youping Guan, Xin Johnson, David R Ang, Choo Y Zhang, Chaoyang Perkins, Edward J |
author_sort | Wei, Xiaomou |
collection | PubMed |
description | BACKGROUND: High throughput transcriptomics profiles such as those generated using microarrays have been useful in identifying biomarkers for different classification and toxicity prediction purposes. Here, we investigated the use of microarrays to predict chemical toxicants and their possible mechanisms of action. RESULTS: In this study, in vitro cultures of primary rat hepatocytes were exposed to 105 chemicals and vehicle controls, representing 14 compound classes. We comprehensively compared various normalization of gene expression profiles, feature selection and classification algorithms for the classification of these 105 chemicals into14 compound classes. We found that normalization had little effect on the averaged classification accuracy. Two support vector machine (SVM) methods, LibSVM and sequential minimal optimization, had better classification performance than other methods. SVM recursive feature selection (SVM-RFE) had the highest overfitting rate when an independent dataset was used for a prediction. Therefore, we developed a new feature selection algorithm called gradient method that had a relatively high training classification as well as prediction accuracy with the lowest overfitting rate of the methods tested. Analysis of biomarkers that distinguished the 14 classes of compounds identified a group of genes principally involved in cell cycle function that were significantly downregulated by metal and inflammatory compounds, but were induced by anti-microbial, cancer related drugs, pesticides, and PXR mediators. CONCLUSIONS: Our results indicate that using microarrays and a supervised machine learning approach to predict chemical toxicants, their potential toxicity and mechanisms of action is practical and efficient. Choosing the right feature and classification algorithms for this multiple category classification and prediction is critical. |
format | Online Article Text |
id | pubmed-4051169 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-40511692014-06-17 Identification of biomarkers that distinguish chemical contaminants based on gene expression profiles Wei, Xiaomou Ai, Junmei Deng, Youping Guan, Xin Johnson, David R Ang, Choo Y Zhang, Chaoyang Perkins, Edward J BMC Genomics Research Article BACKGROUND: High throughput transcriptomics profiles such as those generated using microarrays have been useful in identifying biomarkers for different classification and toxicity prediction purposes. Here, we investigated the use of microarrays to predict chemical toxicants and their possible mechanisms of action. RESULTS: In this study, in vitro cultures of primary rat hepatocytes were exposed to 105 chemicals and vehicle controls, representing 14 compound classes. We comprehensively compared various normalization of gene expression profiles, feature selection and classification algorithms for the classification of these 105 chemicals into14 compound classes. We found that normalization had little effect on the averaged classification accuracy. Two support vector machine (SVM) methods, LibSVM and sequential minimal optimization, had better classification performance than other methods. SVM recursive feature selection (SVM-RFE) had the highest overfitting rate when an independent dataset was used for a prediction. Therefore, we developed a new feature selection algorithm called gradient method that had a relatively high training classification as well as prediction accuracy with the lowest overfitting rate of the methods tested. Analysis of biomarkers that distinguished the 14 classes of compounds identified a group of genes principally involved in cell cycle function that were significantly downregulated by metal and inflammatory compounds, but were induced by anti-microbial, cancer related drugs, pesticides, and PXR mediators. CONCLUSIONS: Our results indicate that using microarrays and a supervised machine learning approach to predict chemical toxicants, their potential toxicity and mechanisms of action is practical and efficient. Choosing the right feature and classification algorithms for this multiple category classification and prediction is critical. BioMed Central 2014-03-31 /pmc/articles/PMC4051169/ /pubmed/24678894 http://dx.doi.org/10.1186/1471-2164-15-248 Text en Copyright © 2014 Wei et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. |
spellingShingle | Research Article Wei, Xiaomou Ai, Junmei Deng, Youping Guan, Xin Johnson, David R Ang, Choo Y Zhang, Chaoyang Perkins, Edward J Identification of biomarkers that distinguish chemical contaminants based on gene expression profiles |
title | Identification of biomarkers that distinguish chemical contaminants based on gene expression profiles |
title_full | Identification of biomarkers that distinguish chemical contaminants based on gene expression profiles |
title_fullStr | Identification of biomarkers that distinguish chemical contaminants based on gene expression profiles |
title_full_unstemmed | Identification of biomarkers that distinguish chemical contaminants based on gene expression profiles |
title_short | Identification of biomarkers that distinguish chemical contaminants based on gene expression profiles |
title_sort | identification of biomarkers that distinguish chemical contaminants based on gene expression profiles |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4051169/ https://www.ncbi.nlm.nih.gov/pubmed/24678894 http://dx.doi.org/10.1186/1471-2164-15-248 |
work_keys_str_mv | AT weixiaomou identificationofbiomarkersthatdistinguishchemicalcontaminantsbasedongeneexpressionprofiles AT aijunmei identificationofbiomarkersthatdistinguishchemicalcontaminantsbasedongeneexpressionprofiles AT dengyouping identificationofbiomarkersthatdistinguishchemicalcontaminantsbasedongeneexpressionprofiles AT guanxin identificationofbiomarkersthatdistinguishchemicalcontaminantsbasedongeneexpressionprofiles AT johnsondavidr identificationofbiomarkersthatdistinguishchemicalcontaminantsbasedongeneexpressionprofiles AT angchooy identificationofbiomarkersthatdistinguishchemicalcontaminantsbasedongeneexpressionprofiles AT zhangchaoyang identificationofbiomarkersthatdistinguishchemicalcontaminantsbasedongeneexpressionprofiles AT perkinsedwardj identificationofbiomarkersthatdistinguishchemicalcontaminantsbasedongeneexpressionprofiles |