Cargando…

Discovery and validation of gene classifiers for endocrine-disrupting chemicals in zebrafish (danio rerio)

BACKGROUND: Development and application of transcriptomics-based gene classifiers for ecotoxicological applications lag far behind those of biomedical sciences. Many such classifiers discovered thus far lack vigorous statistical and experimental validations. A combination of genetic algorithm/suppor...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Rong-Lin, Bencic, David, Biales, Adam, Flick, Robert, Lazorchak, Jim, Villeneuve, Daniel, Ankley, Gerald T
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3469349/
https://www.ncbi.nlm.nih.gov/pubmed/22849515
http://dx.doi.org/10.1186/1471-2164-13-358
_version_ 1782246065418797056
author Wang, Rong-Lin
Bencic, David
Biales, Adam
Flick, Robert
Lazorchak, Jim
Villeneuve, Daniel
Ankley, Gerald T
author_facet Wang, Rong-Lin
Bencic, David
Biales, Adam
Flick, Robert
Lazorchak, Jim
Villeneuve, Daniel
Ankley, Gerald T
author_sort Wang, Rong-Lin
collection PubMed
description BACKGROUND: Development and application of transcriptomics-based gene classifiers for ecotoxicological applications lag far behind those of biomedical sciences. Many such classifiers discovered thus far lack vigorous statistical and experimental validations. A combination of genetic algorithm/support vector machines and genetic algorithm/K nearest neighbors was used in this study to search for classifiers of endocrine-disrupting chemicals (EDCs) in zebrafish. Searches were conducted on both tissue-specific and tissue-combined datasets, either across the entire transcriptome or within individual transcription factor (TF) networks previously linked to EDC effects. Candidate classifiers were evaluated by gene set enrichment analysis (GSEA) on both the original training data and a dedicated validation dataset. RESULTS: Multi-tissue dataset yielded no classifiers. Among the 19 chemical-tissue conditions evaluated, the transcriptome-wide searches yielded classifiers for six of them, each having approximately 20 to 30 gene features unique to a condition. Searches within individual TF networks produced classifiers for 15 chemical-tissue conditions, each containing 100 or fewer top-ranked gene features pooled from those of multiple TF networks and also unique to each condition. For the training dataset, 10 out of 11 classifiers successfully identified the gene expression profiles (GEPs) of their targeted chemical-tissue conditions by GSEA. For the validation dataset, classifiers for prochloraz-ovary and flutamide-ovary also correctly identified the GEPs of corresponding conditions while no classifier could predict the GEP from prochloraz-brain. CONCLUSIONS: The discrepancies in the performance of these classifiers were attributed in part to varying data complexity among the conditions, as measured to some degree by Fisher’s discriminant ratio statistic. This variation in data complexity could likely be compensated by adjusting sample size for individual chemical-tissue conditions, thus suggesting a need for a preliminary survey of transcriptomic responses before launching a full scale classifier discovery effort. Classifier discovery based on individual TF networks could yield more mechanistically-oriented biomarkers. GSEA proved to be a flexible and effective tool for application of gene classifiers but a similar and more refined algorithm, connectivity mapping, should also be explored. The distribution characteristics of classifiers across tissues, chemicals, and TF networks suggested a differential biological impact among the EDCs on zebrafish transcriptome involving some basic cellular functions.
format Online
Article
Text
id pubmed-3469349
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34693492012-10-18 Discovery and validation of gene classifiers for endocrine-disrupting chemicals in zebrafish (danio rerio) Wang, Rong-Lin Bencic, David Biales, Adam Flick, Robert Lazorchak, Jim Villeneuve, Daniel Ankley, Gerald T BMC Genomics Research Article BACKGROUND: Development and application of transcriptomics-based gene classifiers for ecotoxicological applications lag far behind those of biomedical sciences. Many such classifiers discovered thus far lack vigorous statistical and experimental validations. A combination of genetic algorithm/support vector machines and genetic algorithm/K nearest neighbors was used in this study to search for classifiers of endocrine-disrupting chemicals (EDCs) in zebrafish. Searches were conducted on both tissue-specific and tissue-combined datasets, either across the entire transcriptome or within individual transcription factor (TF) networks previously linked to EDC effects. Candidate classifiers were evaluated by gene set enrichment analysis (GSEA) on both the original training data and a dedicated validation dataset. RESULTS: Multi-tissue dataset yielded no classifiers. Among the 19 chemical-tissue conditions evaluated, the transcriptome-wide searches yielded classifiers for six of them, each having approximately 20 to 30 gene features unique to a condition. Searches within individual TF networks produced classifiers for 15 chemical-tissue conditions, each containing 100 or fewer top-ranked gene features pooled from those of multiple TF networks and also unique to each condition. For the training dataset, 10 out of 11 classifiers successfully identified the gene expression profiles (GEPs) of their targeted chemical-tissue conditions by GSEA. For the validation dataset, classifiers for prochloraz-ovary and flutamide-ovary also correctly identified the GEPs of corresponding conditions while no classifier could predict the GEP from prochloraz-brain. CONCLUSIONS: The discrepancies in the performance of these classifiers were attributed in part to varying data complexity among the conditions, as measured to some degree by Fisher’s discriminant ratio statistic. This variation in data complexity could likely be compensated by adjusting sample size for individual chemical-tissue conditions, thus suggesting a need for a preliminary survey of transcriptomic responses before launching a full scale classifier discovery effort. Classifier discovery based on individual TF networks could yield more mechanistically-oriented biomarkers. GSEA proved to be a flexible and effective tool for application of gene classifiers but a similar and more refined algorithm, connectivity mapping, should also be explored. The distribution characteristics of classifiers across tissues, chemicals, and TF networks suggested a differential biological impact among the EDCs on zebrafish transcriptome involving some basic cellular functions. BioMed Central 2012-08-01 /pmc/articles/PMC3469349/ /pubmed/22849515 http://dx.doi.org/10.1186/1471-2164-13-358 Text en Copyright ©2012 Wang et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Wang, Rong-Lin
Bencic, David
Biales, Adam
Flick, Robert
Lazorchak, Jim
Villeneuve, Daniel
Ankley, Gerald T
Discovery and validation of gene classifiers for endocrine-disrupting chemicals in zebrafish (danio rerio)
title Discovery and validation of gene classifiers for endocrine-disrupting chemicals in zebrafish (danio rerio)
title_full Discovery and validation of gene classifiers for endocrine-disrupting chemicals in zebrafish (danio rerio)
title_fullStr Discovery and validation of gene classifiers for endocrine-disrupting chemicals in zebrafish (danio rerio)
title_full_unstemmed Discovery and validation of gene classifiers for endocrine-disrupting chemicals in zebrafish (danio rerio)
title_short Discovery and validation of gene classifiers for endocrine-disrupting chemicals in zebrafish (danio rerio)
title_sort discovery and validation of gene classifiers for endocrine-disrupting chemicals in zebrafish (danio rerio)
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3469349/
https://www.ncbi.nlm.nih.gov/pubmed/22849515
http://dx.doi.org/10.1186/1471-2164-13-358
work_keys_str_mv AT wangronglin discoveryandvalidationofgeneclassifiersforendocrinedisruptingchemicalsinzebrafishdaniorerio
AT bencicdavid discoveryandvalidationofgeneclassifiersforendocrinedisruptingchemicalsinzebrafishdaniorerio
AT bialesadam discoveryandvalidationofgeneclassifiersforendocrinedisruptingchemicalsinzebrafishdaniorerio
AT flickrobert discoveryandvalidationofgeneclassifiersforendocrinedisruptingchemicalsinzebrafishdaniorerio
AT lazorchakjim discoveryandvalidationofgeneclassifiersforendocrinedisruptingchemicalsinzebrafishdaniorerio
AT villeneuvedaniel discoveryandvalidationofgeneclassifiersforendocrinedisruptingchemicalsinzebrafishdaniorerio
AT ankleygeraldt discoveryandvalidationofgeneclassifiersforendocrinedisruptingchemicalsinzebrafishdaniorerio