Cargando…

Of text and gene – using text mining methods to uncover hidden knowledge in toxicogenomics

BACKGROUND: Toxicogenomics studies often profile gene expression from assays involving multiple doses and time points. The dose- and time-dependent pattern is of great importance to assess toxicity but computational approaches are lacking to effectively utilize this characteristic in toxicity assess...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lee, Mikyung, Liu, Zhichao, Kelly, Reagan, Tong, Weida
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2014
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4236689/ https://www.ncbi.nlm.nih.gov/pubmed/25115450 http://dx.doi.org/10.1186/s12918-014-0093-3

_version_	1782345219542351872
author	Lee, Mikyung Liu, Zhichao Kelly, Reagan Tong, Weida
author_facet	Lee, Mikyung Liu, Zhichao Kelly, Reagan Tong, Weida
author_sort	Lee, Mikyung
collection	PubMed
description	BACKGROUND: Toxicogenomics studies often profile gene expression from assays involving multiple doses and time points. The dose- and time-dependent pattern is of great importance to assess toxicity but computational approaches are lacking to effectively utilize this characteristic in toxicity assessment. Topic modeling is a text mining approach, but may be used analogously in toxicogenomics due to the similar data structures between text and gene dysregulation. RESULTS: Topic modeling was applied to a very large toxicogenomics dataset containing microarray gene expression data from >15,000 samples associated with 131 drugs tested in three different assay platforms (i.e., in vitro assay, in vivo repeated dose study and in vivo single dose experiment) with a design including multiple doses and time points. A set of “topics” which each consist of a set of genes was determined, by which the varying sensitivity of three assay systems was observed. We found that the drug-dependent effect was more pronounced in the two in vivo systems than the in vitro system, while the time-dependent effect was most strongly reflected in the in vitro system followed by the single dose study and lastly the repeated dose experiment. The dose-dependent effect was similar across three assay systems. Although the results indicated a challenge to extrapolate the in vitro results to the in vivo situation, we did notice that, for some drugs but not for all the drugs, the similarity in gene expression patterns was observed across all three assay systems, indicating a possibility of using in vitro systems with careful designs (such as the choice of dose and time point), to replace the in vivo testing strategy. Nonetheless, a potential to replace the repeated dose study by the single-dose short-term methodology was strongly implied. CONCLUSIONS: The study demonstrated that text mining methodologies such as topic modeling provide an alternative method compared to traditional means for data reduction in toxicogenomics, enhancing researchers’ capabilities to interpret biological information.
format	Online Article Text
id	pubmed-4236689
institution	National Center for Biotechnology Information
language	English
publishDate	2014
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-42366892014-11-24 Of text and gene – using text mining methods to uncover hidden knowledge in toxicogenomics Lee, Mikyung Liu, Zhichao Kelly, Reagan Tong, Weida BMC Syst Biol Research Article BACKGROUND: Toxicogenomics studies often profile gene expression from assays involving multiple doses and time points. The dose- and time-dependent pattern is of great importance to assess toxicity but computational approaches are lacking to effectively utilize this characteristic in toxicity assessment. Topic modeling is a text mining approach, but may be used analogously in toxicogenomics due to the similar data structures between text and gene dysregulation. RESULTS: Topic modeling was applied to a very large toxicogenomics dataset containing microarray gene expression data from >15,000 samples associated with 131 drugs tested in three different assay platforms (i.e., in vitro assay, in vivo repeated dose study and in vivo single dose experiment) with a design including multiple doses and time points. A set of “topics” which each consist of a set of genes was determined, by which the varying sensitivity of three assay systems was observed. We found that the drug-dependent effect was more pronounced in the two in vivo systems than the in vitro system, while the time-dependent effect was most strongly reflected in the in vitro system followed by the single dose study and lastly the repeated dose experiment. The dose-dependent effect was similar across three assay systems. Although the results indicated a challenge to extrapolate the in vitro results to the in vivo situation, we did notice that, for some drugs but not for all the drugs, the similarity in gene expression patterns was observed across all three assay systems, indicating a possibility of using in vitro systems with careful designs (such as the choice of dose and time point), to replace the in vivo testing strategy. Nonetheless, a potential to replace the repeated dose study by the single-dose short-term methodology was strongly implied. CONCLUSIONS: The study demonstrated that text mining methodologies such as topic modeling provide an alternative method compared to traditional means for data reduction in toxicogenomics, enhancing researchers’ capabilities to interpret biological information. BioMed Central 2014-08-13 /pmc/articles/PMC4236689/ /pubmed/25115450 http://dx.doi.org/10.1186/s12918-014-0093-3 Text en Copyright © 2014 Lee et al.; licensee BioMed Central http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Article Lee, Mikyung Liu, Zhichao Kelly, Reagan Tong, Weida Of text and gene – using text mining methods to uncover hidden knowledge in toxicogenomics
title	Of text and gene – using text mining methods to uncover hidden knowledge in toxicogenomics
title_full	Of text and gene – using text mining methods to uncover hidden knowledge in toxicogenomics
title_fullStr	Of text and gene – using text mining methods to uncover hidden knowledge in toxicogenomics
title_full_unstemmed	Of text and gene – using text mining methods to uncover hidden knowledge in toxicogenomics
title_short	Of text and gene – using text mining methods to uncover hidden knowledge in toxicogenomics
title_sort	of text and gene – using text mining methods to uncover hidden knowledge in toxicogenomics
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4236689/ https://www.ncbi.nlm.nih.gov/pubmed/25115450 http://dx.doi.org/10.1186/s12918-014-0093-3
work_keys_str_mv	AT leemikyung oftextandgeneusingtextminingmethodstouncoverhiddenknowledgeintoxicogenomics AT liuzhichao oftextandgeneusingtextminingmethodstouncoverhiddenknowledgeintoxicogenomics AT kellyreagan oftextandgeneusingtextminingmethodstouncoverhiddenknowledgeintoxicogenomics AT tongweida oftextandgeneusingtextminingmethodstouncoverhiddenknowledgeintoxicogenomics

Of text and gene – using text mining methods to uncover hidden knowledge in toxicogenomics

Ejemplares similares