Cargando…
An automatic method to generate domain-specific investigator networks using PubMed abstracts
BACKGROUND: Collaboration among investigators has become critical to scientific research. This includes ad hoc collaboration established through personal contacts as well as formal consortia established by funding agencies. Continued growth in online resources for scientific research and communicati...
Autores principales: | , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1931433/ https://www.ncbi.nlm.nih.gov/pubmed/17584920 http://dx.doi.org/10.1186/1472-6947-7-17 |
_version_ | 1782134276512284672 |
---|---|
author | Yu, Wei Yesupriya, Ajay Wulf, Anja Qu, Junfeng Gwinn, Marta Khoury, Muin J |
author_facet | Yu, Wei Yesupriya, Ajay Wulf, Anja Qu, Junfeng Gwinn, Marta Khoury, Muin J |
author_sort | Yu, Wei |
collection | PubMed |
description | BACKGROUND: Collaboration among investigators has become critical to scientific research. This includes ad hoc collaboration established through personal contacts as well as formal consortia established by funding agencies. Continued growth in online resources for scientific research and communication has promoted the development of highly networked research communities. Extending these networks globally requires identifying additional investigators in a given domain, profiling their research interests, and collecting current contact information. We present a novel strategy for building investigator networks dynamically and producing detailed investigator profiles using data available in PubMed abstracts. RESULTS: We developed a novel strategy to obtain detailed investigator information by automatically parsing the affiliation string in PubMed records. We illustrated the results by using a published literature database in human genome epidemiology (HuGE Pub Lit) as a test case. Our parsing strategy extracted country information from 92.1% of the affiliation strings in a random sample of PubMed records and in 97.0% of HuGE records, with accuracies of 94.0% and 91.0%, respectively. Institution information was parsed from 91.3% of the general PubMed records (accuracy 86.8%) and from 94.2% of HuGE PubMed records (accuracy 87.0). We demonstrated the application of our approach to dynamic creation of investigator networks by creating a prototype information system containing a large database of PubMed abstracts relevant to human genome epidemiology (HuGE Pub Lit), indexed using PubMed medical subject headings converted to Unified Medical Language System concepts. Our method was able to identify 70–90% of the investigators/collaborators in three different human genetics fields; it also successfully identified 9 of 10 genetics investigators within the PREBIC network, an existing preterm birth research network. CONCLUSION: We successfully created a web-based prototype capable of creating domain-specific investigator networks based on an application that accurately generates detailed investigator profiles from PubMed abstracts combined with robust standard vocabularies. This approach could be used for other biomedical fields to efficiently establish domain-specific investigator networks. |
format | Text |
id | pubmed-1931433 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-19314332007-07-24 An automatic method to generate domain-specific investigator networks using PubMed abstracts Yu, Wei Yesupriya, Ajay Wulf, Anja Qu, Junfeng Gwinn, Marta Khoury, Muin J BMC Med Inform Decis Mak Research Article BACKGROUND: Collaboration among investigators has become critical to scientific research. This includes ad hoc collaboration established through personal contacts as well as formal consortia established by funding agencies. Continued growth in online resources for scientific research and communication has promoted the development of highly networked research communities. Extending these networks globally requires identifying additional investigators in a given domain, profiling their research interests, and collecting current contact information. We present a novel strategy for building investigator networks dynamically and producing detailed investigator profiles using data available in PubMed abstracts. RESULTS: We developed a novel strategy to obtain detailed investigator information by automatically parsing the affiliation string in PubMed records. We illustrated the results by using a published literature database in human genome epidemiology (HuGE Pub Lit) as a test case. Our parsing strategy extracted country information from 92.1% of the affiliation strings in a random sample of PubMed records and in 97.0% of HuGE records, with accuracies of 94.0% and 91.0%, respectively. Institution information was parsed from 91.3% of the general PubMed records (accuracy 86.8%) and from 94.2% of HuGE PubMed records (accuracy 87.0). We demonstrated the application of our approach to dynamic creation of investigator networks by creating a prototype information system containing a large database of PubMed abstracts relevant to human genome epidemiology (HuGE Pub Lit), indexed using PubMed medical subject headings converted to Unified Medical Language System concepts. Our method was able to identify 70–90% of the investigators/collaborators in three different human genetics fields; it also successfully identified 9 of 10 genetics investigators within the PREBIC network, an existing preterm birth research network. CONCLUSION: We successfully created a web-based prototype capable of creating domain-specific investigator networks based on an application that accurately generates detailed investigator profiles from PubMed abstracts combined with robust standard vocabularies. This approach could be used for other biomedical fields to efficiently establish domain-specific investigator networks. BioMed Central 2007-06-20 /pmc/articles/PMC1931433/ /pubmed/17584920 http://dx.doi.org/10.1186/1472-6947-7-17 Text en Copyright © 2007 Yu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Yu, Wei Yesupriya, Ajay Wulf, Anja Qu, Junfeng Gwinn, Marta Khoury, Muin J An automatic method to generate domain-specific investigator networks using PubMed abstracts |
title | An automatic method to generate domain-specific investigator networks using PubMed abstracts |
title_full | An automatic method to generate domain-specific investigator networks using PubMed abstracts |
title_fullStr | An automatic method to generate domain-specific investigator networks using PubMed abstracts |
title_full_unstemmed | An automatic method to generate domain-specific investigator networks using PubMed abstracts |
title_short | An automatic method to generate domain-specific investigator networks using PubMed abstracts |
title_sort | automatic method to generate domain-specific investigator networks using pubmed abstracts |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1931433/ https://www.ncbi.nlm.nih.gov/pubmed/17584920 http://dx.doi.org/10.1186/1472-6947-7-17 |
work_keys_str_mv | AT yuwei anautomaticmethodtogeneratedomainspecificinvestigatornetworksusingpubmedabstracts AT yesupriyaajay anautomaticmethodtogeneratedomainspecificinvestigatornetworksusingpubmedabstracts AT wulfanja anautomaticmethodtogeneratedomainspecificinvestigatornetworksusingpubmedabstracts AT qujunfeng anautomaticmethodtogeneratedomainspecificinvestigatornetworksusingpubmedabstracts AT gwinnmarta anautomaticmethodtogeneratedomainspecificinvestigatornetworksusingpubmedabstracts AT khourymuinj anautomaticmethodtogeneratedomainspecificinvestigatornetworksusingpubmedabstracts AT yuwei automaticmethodtogeneratedomainspecificinvestigatornetworksusingpubmedabstracts AT yesupriyaajay automaticmethodtogeneratedomainspecificinvestigatornetworksusingpubmedabstracts AT wulfanja automaticmethodtogeneratedomainspecificinvestigatornetworksusingpubmedabstracts AT qujunfeng automaticmethodtogeneratedomainspecificinvestigatornetworksusingpubmedabstracts AT gwinnmarta automaticmethodtogeneratedomainspecificinvestigatornetworksusingpubmedabstracts AT khourymuinj automaticmethodtogeneratedomainspecificinvestigatornetworksusingpubmedabstracts |