Cargando…

Identification of biomarkers that distinguish chemical contaminants based on gene expression profiles

BACKGROUND: High throughput transcriptomics profiles such as those generated using microarrays have been useful in identifying biomarkers for different classification and toxicity prediction purposes. Here, we investigated the use of microarrays to predict chemical toxicants and their possible mecha...

Descripción completa

Detalles Bibliográficos
Autores principales: Wei, Xiaomou, Ai, Junmei, Deng, Youping, Guan, Xin, Johnson, David R, Ang, Choo Y, Zhang, Chaoyang, Perkins, Edward J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4051169/
https://www.ncbi.nlm.nih.gov/pubmed/24678894
http://dx.doi.org/10.1186/1471-2164-15-248
_version_ 1782320071286194176
author Wei, Xiaomou
Ai, Junmei
Deng, Youping
Guan, Xin
Johnson, David R
Ang, Choo Y
Zhang, Chaoyang
Perkins, Edward J
author_facet Wei, Xiaomou
Ai, Junmei
Deng, Youping
Guan, Xin
Johnson, David R
Ang, Choo Y
Zhang, Chaoyang
Perkins, Edward J
author_sort Wei, Xiaomou
collection PubMed
description BACKGROUND: High throughput transcriptomics profiles such as those generated using microarrays have been useful in identifying biomarkers for different classification and toxicity prediction purposes. Here, we investigated the use of microarrays to predict chemical toxicants and their possible mechanisms of action. RESULTS: In this study, in vitro cultures of primary rat hepatocytes were exposed to 105 chemicals and vehicle controls, representing 14 compound classes. We comprehensively compared various normalization of gene expression profiles, feature selection and classification algorithms for the classification of these 105 chemicals into14 compound classes. We found that normalization had little effect on the averaged classification accuracy. Two support vector machine (SVM) methods, LibSVM and sequential minimal optimization, had better classification performance than other methods. SVM recursive feature selection (SVM-RFE) had the highest overfitting rate when an independent dataset was used for a prediction. Therefore, we developed a new feature selection algorithm called gradient method that had a relatively high training classification as well as prediction accuracy with the lowest overfitting rate of the methods tested. Analysis of biomarkers that distinguished the 14 classes of compounds identified a group of genes principally involved in cell cycle function that were significantly downregulated by metal and inflammatory compounds, but were induced by anti-microbial, cancer related drugs, pesticides, and PXR mediators. CONCLUSIONS: Our results indicate that using microarrays and a supervised machine learning approach to predict chemical toxicants, their potential toxicity and mechanisms of action is practical and efficient. Choosing the right feature and classification algorithms for this multiple category classification and prediction is critical.
format Online
Article
Text
id pubmed-4051169
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40511692014-06-17 Identification of biomarkers that distinguish chemical contaminants based on gene expression profiles Wei, Xiaomou Ai, Junmei Deng, Youping Guan, Xin Johnson, David R Ang, Choo Y Zhang, Chaoyang Perkins, Edward J BMC Genomics Research Article BACKGROUND: High throughput transcriptomics profiles such as those generated using microarrays have been useful in identifying biomarkers for different classification and toxicity prediction purposes. Here, we investigated the use of microarrays to predict chemical toxicants and their possible mechanisms of action. RESULTS: In this study, in vitro cultures of primary rat hepatocytes were exposed to 105 chemicals and vehicle controls, representing 14 compound classes. We comprehensively compared various normalization of gene expression profiles, feature selection and classification algorithms for the classification of these 105 chemicals into14 compound classes. We found that normalization had little effect on the averaged classification accuracy. Two support vector machine (SVM) methods, LibSVM and sequential minimal optimization, had better classification performance than other methods. SVM recursive feature selection (SVM-RFE) had the highest overfitting rate when an independent dataset was used for a prediction. Therefore, we developed a new feature selection algorithm called gradient method that had a relatively high training classification as well as prediction accuracy with the lowest overfitting rate of the methods tested. Analysis of biomarkers that distinguished the 14 classes of compounds identified a group of genes principally involved in cell cycle function that were significantly downregulated by metal and inflammatory compounds, but were induced by anti-microbial, cancer related drugs, pesticides, and PXR mediators. CONCLUSIONS: Our results indicate that using microarrays and a supervised machine learning approach to predict chemical toxicants, their potential toxicity and mechanisms of action is practical and efficient. Choosing the right feature and classification algorithms for this multiple category classification and prediction is critical. BioMed Central 2014-03-31 /pmc/articles/PMC4051169/ /pubmed/24678894 http://dx.doi.org/10.1186/1471-2164-15-248 Text en Copyright © 2014 Wei et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited.
spellingShingle Research Article
Wei, Xiaomou
Ai, Junmei
Deng, Youping
Guan, Xin
Johnson, David R
Ang, Choo Y
Zhang, Chaoyang
Perkins, Edward J
Identification of biomarkers that distinguish chemical contaminants based on gene expression profiles
title Identification of biomarkers that distinguish chemical contaminants based on gene expression profiles
title_full Identification of biomarkers that distinguish chemical contaminants based on gene expression profiles
title_fullStr Identification of biomarkers that distinguish chemical contaminants based on gene expression profiles
title_full_unstemmed Identification of biomarkers that distinguish chemical contaminants based on gene expression profiles
title_short Identification of biomarkers that distinguish chemical contaminants based on gene expression profiles
title_sort identification of biomarkers that distinguish chemical contaminants based on gene expression profiles
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4051169/
https://www.ncbi.nlm.nih.gov/pubmed/24678894
http://dx.doi.org/10.1186/1471-2164-15-248
work_keys_str_mv AT weixiaomou identificationofbiomarkersthatdistinguishchemicalcontaminantsbasedongeneexpressionprofiles
AT aijunmei identificationofbiomarkersthatdistinguishchemicalcontaminantsbasedongeneexpressionprofiles
AT dengyouping identificationofbiomarkersthatdistinguishchemicalcontaminantsbasedongeneexpressionprofiles
AT guanxin identificationofbiomarkersthatdistinguishchemicalcontaminantsbasedongeneexpressionprofiles
AT johnsondavidr identificationofbiomarkersthatdistinguishchemicalcontaminantsbasedongeneexpressionprofiles
AT angchooy identificationofbiomarkersthatdistinguishchemicalcontaminantsbasedongeneexpressionprofiles
AT zhangchaoyang identificationofbiomarkersthatdistinguishchemicalcontaminantsbasedongeneexpressionprofiles
AT perkinsedwardj identificationofbiomarkersthatdistinguishchemicalcontaminantsbasedongeneexpressionprofiles