Cargando…

CoPub Mapper: mining MEDLINE based on search term co-publication

BACKGROUND: High throughput microarray analyses result in many differentially expressed genes that are potentially responsible for the biological process of interest. In order to identify biological similarities between genes, publications from MEDLINE were identified in which pairs of gene names an...

Descripción completa

Detalles Bibliográficos
Autores principales: Alako, Blaise TF, Veldhoven, Antoine, van Baal, Sjozef, Jelier, Rob, Verhoeven, Stefan, Rullmann, Ton, Polman, Jan, Jenster, Guido
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1274248/
https://www.ncbi.nlm.nih.gov/pubmed/15760478
http://dx.doi.org/10.1186/1471-2105-6-51
_version_ 1782125972463550464
author Alako, Blaise TF
Veldhoven, Antoine
van Baal, Sjozef
Jelier, Rob
Verhoeven, Stefan
Rullmann, Ton
Polman, Jan
Jenster, Guido
author_facet Alako, Blaise TF
Veldhoven, Antoine
van Baal, Sjozef
Jelier, Rob
Verhoeven, Stefan
Rullmann, Ton
Polman, Jan
Jenster, Guido
author_sort Alako, Blaise TF
collection PubMed
description BACKGROUND: High throughput microarray analyses result in many differentially expressed genes that are potentially responsible for the biological process of interest. In order to identify biological similarities between genes, publications from MEDLINE were identified in which pairs of gene names and combinations of gene name with specific keywords were co-mentioned. RESULTS: MEDLINE search strings for 15,621 known genes and 3,731 keywords were generated and validated. PubMed IDs were retrieved from MEDLINE and relative probability of co-occurrences of all gene-gene and gene-keyword pairs determined. To assess gene clustering according to literature co-publication, 150 genes consisting of 8 sets with known connections (same pathway, same protein complex, or same cellular localization, etc.) were run through the program. Receiver operator characteristics (ROC) analyses showed that most gene sets were clustered much better than expected by random chance. To test grouping of genes from real microarray data, 221 differentially expressed genes from a microarray experiment were analyzed with CoPub Mapper, which resulted in several relevant clusters of genes with biological process and disease keywords. In addition, all genes versus keywords were hierarchical clustered to reveal a complete grouping of published genes based on co-occurrence. CONCLUSION: The CoPub Mapper program allows for quick and versatile querying of co-published genes and keywords and can be successfully used to cluster predefined groups of genes and microarray data.
format Text
id pubmed-1274248
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-12742482005-10-29 CoPub Mapper: mining MEDLINE based on search term co-publication Alako, Blaise TF Veldhoven, Antoine van Baal, Sjozef Jelier, Rob Verhoeven, Stefan Rullmann, Ton Polman, Jan Jenster, Guido BMC Bioinformatics Software BACKGROUND: High throughput microarray analyses result in many differentially expressed genes that are potentially responsible for the biological process of interest. In order to identify biological similarities between genes, publications from MEDLINE were identified in which pairs of gene names and combinations of gene name with specific keywords were co-mentioned. RESULTS: MEDLINE search strings for 15,621 known genes and 3,731 keywords were generated and validated. PubMed IDs were retrieved from MEDLINE and relative probability of co-occurrences of all gene-gene and gene-keyword pairs determined. To assess gene clustering according to literature co-publication, 150 genes consisting of 8 sets with known connections (same pathway, same protein complex, or same cellular localization, etc.) were run through the program. Receiver operator characteristics (ROC) analyses showed that most gene sets were clustered much better than expected by random chance. To test grouping of genes from real microarray data, 221 differentially expressed genes from a microarray experiment were analyzed with CoPub Mapper, which resulted in several relevant clusters of genes with biological process and disease keywords. In addition, all genes versus keywords were hierarchical clustered to reveal a complete grouping of published genes based on co-occurrence. CONCLUSION: The CoPub Mapper program allows for quick and versatile querying of co-published genes and keywords and can be successfully used to cluster predefined groups of genes and microarray data. BioMed Central 2005-03-11 /pmc/articles/PMC1274248/ /pubmed/15760478 http://dx.doi.org/10.1186/1471-2105-6-51 Text en Copyright © 2005 Alako et al; licensee BioMed Central Ltd.
spellingShingle Software
Alako, Blaise TF
Veldhoven, Antoine
van Baal, Sjozef
Jelier, Rob
Verhoeven, Stefan
Rullmann, Ton
Polman, Jan
Jenster, Guido
CoPub Mapper: mining MEDLINE based on search term co-publication
title CoPub Mapper: mining MEDLINE based on search term co-publication
title_full CoPub Mapper: mining MEDLINE based on search term co-publication
title_fullStr CoPub Mapper: mining MEDLINE based on search term co-publication
title_full_unstemmed CoPub Mapper: mining MEDLINE based on search term co-publication
title_short CoPub Mapper: mining MEDLINE based on search term co-publication
title_sort copub mapper: mining medline based on search term co-publication
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1274248/
https://www.ncbi.nlm.nih.gov/pubmed/15760478
http://dx.doi.org/10.1186/1471-2105-6-51
work_keys_str_mv AT alakoblaisetf copubmapperminingmedlinebasedonsearchtermcopublication
AT veldhovenantoine copubmapperminingmedlinebasedonsearchtermcopublication
AT vanbaalsjozef copubmapperminingmedlinebasedonsearchtermcopublication
AT jelierrob copubmapperminingmedlinebasedonsearchtermcopublication
AT verhoevenstefan copubmapperminingmedlinebasedonsearchtermcopublication
AT rullmannton copubmapperminingmedlinebasedonsearchtermcopublication
AT polmanjan copubmapperminingmedlinebasedonsearchtermcopublication
AT jensterguido copubmapperminingmedlinebasedonsearchtermcopublication