Cargando…

An automatic method to generate domain-specific investigator networks using PubMed abstracts

BACKGROUND: Collaboration among investigators has become critical to scientific research. This includes ad hoc collaboration established through personal contacts as well as formal consortia established by funding agencies. Continued growth in online resources for scientific research and communicati...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Wei, Yesupriya, Ajay, Wulf, Anja, Qu, Junfeng, Gwinn, Marta, Khoury, Muin J
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1931433/
https://www.ncbi.nlm.nih.gov/pubmed/17584920
http://dx.doi.org/10.1186/1472-6947-7-17
_version_ 1782134276512284672
author Yu, Wei
Yesupriya, Ajay
Wulf, Anja
Qu, Junfeng
Gwinn, Marta
Khoury, Muin J
author_facet Yu, Wei
Yesupriya, Ajay
Wulf, Anja
Qu, Junfeng
Gwinn, Marta
Khoury, Muin J
author_sort Yu, Wei
collection PubMed
description BACKGROUND: Collaboration among investigators has become critical to scientific research. This includes ad hoc collaboration established through personal contacts as well as formal consortia established by funding agencies. Continued growth in online resources for scientific research and communication has promoted the development of highly networked research communities. Extending these networks globally requires identifying additional investigators in a given domain, profiling their research interests, and collecting current contact information. We present a novel strategy for building investigator networks dynamically and producing detailed investigator profiles using data available in PubMed abstracts. RESULTS: We developed a novel strategy to obtain detailed investigator information by automatically parsing the affiliation string in PubMed records. We illustrated the results by using a published literature database in human genome epidemiology (HuGE Pub Lit) as a test case. Our parsing strategy extracted country information from 92.1% of the affiliation strings in a random sample of PubMed records and in 97.0% of HuGE records, with accuracies of 94.0% and 91.0%, respectively. Institution information was parsed from 91.3% of the general PubMed records (accuracy 86.8%) and from 94.2% of HuGE PubMed records (accuracy 87.0). We demonstrated the application of our approach to dynamic creation of investigator networks by creating a prototype information system containing a large database of PubMed abstracts relevant to human genome epidemiology (HuGE Pub Lit), indexed using PubMed medical subject headings converted to Unified Medical Language System concepts. Our method was able to identify 70–90% of the investigators/collaborators in three different human genetics fields; it also successfully identified 9 of 10 genetics investigators within the PREBIC network, an existing preterm birth research network. CONCLUSION: We successfully created a web-based prototype capable of creating domain-specific investigator networks based on an application that accurately generates detailed investigator profiles from PubMed abstracts combined with robust standard vocabularies. This approach could be used for other biomedical fields to efficiently establish domain-specific investigator networks.
format Text
id pubmed-1931433
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-19314332007-07-24 An automatic method to generate domain-specific investigator networks using PubMed abstracts Yu, Wei Yesupriya, Ajay Wulf, Anja Qu, Junfeng Gwinn, Marta Khoury, Muin J BMC Med Inform Decis Mak Research Article BACKGROUND: Collaboration among investigators has become critical to scientific research. This includes ad hoc collaboration established through personal contacts as well as formal consortia established by funding agencies. Continued growth in online resources for scientific research and communication has promoted the development of highly networked research communities. Extending these networks globally requires identifying additional investigators in a given domain, profiling their research interests, and collecting current contact information. We present a novel strategy for building investigator networks dynamically and producing detailed investigator profiles using data available in PubMed abstracts. RESULTS: We developed a novel strategy to obtain detailed investigator information by automatically parsing the affiliation string in PubMed records. We illustrated the results by using a published literature database in human genome epidemiology (HuGE Pub Lit) as a test case. Our parsing strategy extracted country information from 92.1% of the affiliation strings in a random sample of PubMed records and in 97.0% of HuGE records, with accuracies of 94.0% and 91.0%, respectively. Institution information was parsed from 91.3% of the general PubMed records (accuracy 86.8%) and from 94.2% of HuGE PubMed records (accuracy 87.0). We demonstrated the application of our approach to dynamic creation of investigator networks by creating a prototype information system containing a large database of PubMed abstracts relevant to human genome epidemiology (HuGE Pub Lit), indexed using PubMed medical subject headings converted to Unified Medical Language System concepts. Our method was able to identify 70–90% of the investigators/collaborators in three different human genetics fields; it also successfully identified 9 of 10 genetics investigators within the PREBIC network, an existing preterm birth research network. CONCLUSION: We successfully created a web-based prototype capable of creating domain-specific investigator networks based on an application that accurately generates detailed investigator profiles from PubMed abstracts combined with robust standard vocabularies. This approach could be used for other biomedical fields to efficiently establish domain-specific investigator networks. BioMed Central 2007-06-20 /pmc/articles/PMC1931433/ /pubmed/17584920 http://dx.doi.org/10.1186/1472-6947-7-17 Text en Copyright © 2007 Yu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Yu, Wei
Yesupriya, Ajay
Wulf, Anja
Qu, Junfeng
Gwinn, Marta
Khoury, Muin J
An automatic method to generate domain-specific investigator networks using PubMed abstracts
title An automatic method to generate domain-specific investigator networks using PubMed abstracts
title_full An automatic method to generate domain-specific investigator networks using PubMed abstracts
title_fullStr An automatic method to generate domain-specific investigator networks using PubMed abstracts
title_full_unstemmed An automatic method to generate domain-specific investigator networks using PubMed abstracts
title_short An automatic method to generate domain-specific investigator networks using PubMed abstracts
title_sort automatic method to generate domain-specific investigator networks using pubmed abstracts
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1931433/
https://www.ncbi.nlm.nih.gov/pubmed/17584920
http://dx.doi.org/10.1186/1472-6947-7-17
work_keys_str_mv AT yuwei anautomaticmethodtogeneratedomainspecificinvestigatornetworksusingpubmedabstracts
AT yesupriyaajay anautomaticmethodtogeneratedomainspecificinvestigatornetworksusingpubmedabstracts
AT wulfanja anautomaticmethodtogeneratedomainspecificinvestigatornetworksusingpubmedabstracts
AT qujunfeng anautomaticmethodtogeneratedomainspecificinvestigatornetworksusingpubmedabstracts
AT gwinnmarta anautomaticmethodtogeneratedomainspecificinvestigatornetworksusingpubmedabstracts
AT khourymuinj anautomaticmethodtogeneratedomainspecificinvestigatornetworksusingpubmedabstracts
AT yuwei automaticmethodtogeneratedomainspecificinvestigatornetworksusingpubmedabstracts
AT yesupriyaajay automaticmethodtogeneratedomainspecificinvestigatornetworksusingpubmedabstracts
AT wulfanja automaticmethodtogeneratedomainspecificinvestigatornetworksusingpubmedabstracts
AT qujunfeng automaticmethodtogeneratedomainspecificinvestigatornetworksusingpubmedabstracts
AT gwinnmarta automaticmethodtogeneratedomainspecificinvestigatornetworksusingpubmedabstracts
AT khourymuinj automaticmethodtogeneratedomainspecificinvestigatornetworksusingpubmedabstracts