Cargando…

PubMatrix: a tool for multiplex literature mining

BACKGROUND: Molecular experiments using multiplex strategies such as cDNA microarrays or proteomic approaches generate large datasets requiring biological interpretation. Text based data mining tools have recently been developed to query large biological datasets of this type of data. PubMatrix is a...

Descripción completa

Detalles Bibliográficos
Autores principales: Becker, Kevin G, Hosack, Douglas A, Dennis, Glynn, Lempicki, Richard A, Bright, Tiffani J, Cheadle, Chris, Engel, Jim
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2003
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC317283/
https://www.ncbi.nlm.nih.gov/pubmed/14667255
http://dx.doi.org/10.1186/1471-2105-4-61
_version_ 1782121133363953664
author Becker, Kevin G
Hosack, Douglas A
Dennis, Glynn
Lempicki, Richard A
Bright, Tiffani J
Cheadle, Chris
Engel, Jim
author_facet Becker, Kevin G
Hosack, Douglas A
Dennis, Glynn
Lempicki, Richard A
Bright, Tiffani J
Cheadle, Chris
Engel, Jim
author_sort Becker, Kevin G
collection PubMed
description BACKGROUND: Molecular experiments using multiplex strategies such as cDNA microarrays or proteomic approaches generate large datasets requiring biological interpretation. Text based data mining tools have recently been developed to query large biological datasets of this type of data. PubMatrix is a web-based tool that allows simple text based mining of the NCBI literature search service PubMed using any two lists of keywords terms, resulting in a frequency matrix of term co-occurrence. RESULTS: For example, a simple term selection procedure allows automatic pair-wise comparisons of approximately 1–100 search terms versus approximately 1–10 modifier terms, resulting in up to 1,000 pair wise comparisons. The matrix table of pair-wise comparisons can then be surveyed, queried individually, and archived. Lists of keywords can include any terms currently capable of being searched in PubMed. In the context of cDNA microarray studies, this may be used for the annotation of gene lists from clusters of genes that are expressed coordinately. An associated PubMatrix public archive provides previous searches using common useful lists of keyword terms. CONCLUSIONS: In this way, lists of terms, such as gene names, or functional assignments can be assigned genetic, biological, or clinical relevance in a rapid flexible systematic fashion.
format Text
id pubmed-317283
institution National Center for Biotechnology Information
language English
publishDate 2003
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-3172832004-01-23 PubMatrix: a tool for multiplex literature mining Becker, Kevin G Hosack, Douglas A Dennis, Glynn Lempicki, Richard A Bright, Tiffani J Cheadle, Chris Engel, Jim BMC Bioinformatics Software BACKGROUND: Molecular experiments using multiplex strategies such as cDNA microarrays or proteomic approaches generate large datasets requiring biological interpretation. Text based data mining tools have recently been developed to query large biological datasets of this type of data. PubMatrix is a web-based tool that allows simple text based mining of the NCBI literature search service PubMed using any two lists of keywords terms, resulting in a frequency matrix of term co-occurrence. RESULTS: For example, a simple term selection procedure allows automatic pair-wise comparisons of approximately 1–100 search terms versus approximately 1–10 modifier terms, resulting in up to 1,000 pair wise comparisons. The matrix table of pair-wise comparisons can then be surveyed, queried individually, and archived. Lists of keywords can include any terms currently capable of being searched in PubMed. In the context of cDNA microarray studies, this may be used for the annotation of gene lists from clusters of genes that are expressed coordinately. An associated PubMatrix public archive provides previous searches using common useful lists of keyword terms. CONCLUSIONS: In this way, lists of terms, such as gene names, or functional assignments can be assigned genetic, biological, or clinical relevance in a rapid flexible systematic fashion. BioMed Central 2003-12-10 /pmc/articles/PMC317283/ /pubmed/14667255 http://dx.doi.org/10.1186/1471-2105-4-61 Text en Copyright © 2003 Becker et al; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.
spellingShingle Software
Becker, Kevin G
Hosack, Douglas A
Dennis, Glynn
Lempicki, Richard A
Bright, Tiffani J
Cheadle, Chris
Engel, Jim
PubMatrix: a tool for multiplex literature mining
title PubMatrix: a tool for multiplex literature mining
title_full PubMatrix: a tool for multiplex literature mining
title_fullStr PubMatrix: a tool for multiplex literature mining
title_full_unstemmed PubMatrix: a tool for multiplex literature mining
title_short PubMatrix: a tool for multiplex literature mining
title_sort pubmatrix: a tool for multiplex literature mining
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC317283/
https://www.ncbi.nlm.nih.gov/pubmed/14667255
http://dx.doi.org/10.1186/1471-2105-4-61
work_keys_str_mv AT beckerkeving pubmatrixatoolformultiplexliteraturemining
AT hosackdouglasa pubmatrixatoolformultiplexliteraturemining
AT dennisglynn pubmatrixatoolformultiplexliteraturemining
AT lempickiricharda pubmatrixatoolformultiplexliteraturemining
AT brighttiffanij pubmatrixatoolformultiplexliteraturemining
AT cheadlechris pubmatrixatoolformultiplexliteraturemining
AT engeljim pubmatrixatoolformultiplexliteraturemining