Cargando…

PubTerm: a web tool for organizing, annotating and curating genes, diseases, molecules and other concepts from PubMed records

BACKGROUND AND OBJECTIVE: Analysis, annotation and curation of biomedical scientific literature is a recurrent task in biomedical research, database curation and clinics. Commonly, the reading is centered on concepts such as genes, diseases or molecules. Database curators may also need to annotate p...

Descripción completa

Detalles Bibliográficos
Autores principales: Garcia-Pelaez, José, Rodriguez, David, Medina-Molina, Roberto, Garcia-Rivas, Gerardo, Jerjes-Sánchez, Carlos, Trevino, Victor
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6323318/
https://www.ncbi.nlm.nih.gov/pubmed/30624653
http://dx.doi.org/10.1093/database/bay137
_version_ 1783385739775442944
author Garcia-Pelaez, José
Rodriguez, David
Medina-Molina, Roberto
Garcia-Rivas, Gerardo
Jerjes-Sánchez, Carlos
Trevino, Victor
author_facet Garcia-Pelaez, José
Rodriguez, David
Medina-Molina, Roberto
Garcia-Rivas, Gerardo
Jerjes-Sánchez, Carlos
Trevino, Victor
author_sort Garcia-Pelaez, José
collection PubMed
description BACKGROUND AND OBJECTIVE: Analysis, annotation and curation of biomedical scientific literature is a recurrent task in biomedical research, database curation and clinics. Commonly, the reading is centered on concepts such as genes, diseases or molecules. Database curators may also need to annotate published abstracts related to a specific topic. However, few free and intuitive tools exist to assist users in this context. Therefore, we developed PubTerm, a web tool to organize, categorize, curate and annotate a large number of PubMed abstracts related to biological entities such as genes, diseases, chemicals, species, sequence variants and other related information. METHODS: A variety of interfaces were implemented to facilitate curation and annotation, including the organization of abstracts by terms, by the co-occurrence of terms or by specific phrases. Information includes statistics on the occurrence of terms. The abstracts, terms and other related information can be annotated and categorized using user-defined categories. The session information can be saved and restored, and the data can be exported to other formats. RESULTS: The pipeline in PubTerm starts by specifying a PubMed query or list of PubMed identifiers. Then, the user can specify three lists of categories and specify what information will be highlighted in which colors. The user then utilizes the `term view’ to organize the abstracts by gene, disease, species or other information to facilitate the annotation and categorization of terms or abstracts. Other views also facilitate the exploration of abstracts and connections between terms. We have used PubTerm to quickly and efficiently curate collections of more than 400 abstracts that mention more than 350 genes to generate revised lists of susceptibility genes for diseases. An example is provided for pulmonary arterial hypertension. CONCLUSIONS: PubTerm saves time for literature revision by assisting with annotation organization and knowledge acquisition.
format Online
Article
Text
id pubmed-6323318
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-63233182019-01-10 PubTerm: a web tool for organizing, annotating and curating genes, diseases, molecules and other concepts from PubMed records Garcia-Pelaez, José Rodriguez, David Medina-Molina, Roberto Garcia-Rivas, Gerardo Jerjes-Sánchez, Carlos Trevino, Victor Database (Oxford) Database Tool BACKGROUND AND OBJECTIVE: Analysis, annotation and curation of biomedical scientific literature is a recurrent task in biomedical research, database curation and clinics. Commonly, the reading is centered on concepts such as genes, diseases or molecules. Database curators may also need to annotate published abstracts related to a specific topic. However, few free and intuitive tools exist to assist users in this context. Therefore, we developed PubTerm, a web tool to organize, categorize, curate and annotate a large number of PubMed abstracts related to biological entities such as genes, diseases, chemicals, species, sequence variants and other related information. METHODS: A variety of interfaces were implemented to facilitate curation and annotation, including the organization of abstracts by terms, by the co-occurrence of terms or by specific phrases. Information includes statistics on the occurrence of terms. The abstracts, terms and other related information can be annotated and categorized using user-defined categories. The session information can be saved and restored, and the data can be exported to other formats. RESULTS: The pipeline in PubTerm starts by specifying a PubMed query or list of PubMed identifiers. Then, the user can specify three lists of categories and specify what information will be highlighted in which colors. The user then utilizes the `term view’ to organize the abstracts by gene, disease, species or other information to facilitate the annotation and categorization of terms or abstracts. Other views also facilitate the exploration of abstracts and connections between terms. We have used PubTerm to quickly and efficiently curate collections of more than 400 abstracts that mention more than 350 genes to generate revised lists of susceptibility genes for diseases. An example is provided for pulmonary arterial hypertension. CONCLUSIONS: PubTerm saves time for literature revision by assisting with annotation organization and knowledge acquisition. Oxford University Press 2019-01-08 /pmc/articles/PMC6323318/ /pubmed/30624653 http://dx.doi.org/10.1093/database/bay137 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database Tool
Garcia-Pelaez, José
Rodriguez, David
Medina-Molina, Roberto
Garcia-Rivas, Gerardo
Jerjes-Sánchez, Carlos
Trevino, Victor
PubTerm: a web tool for organizing, annotating and curating genes, diseases, molecules and other concepts from PubMed records
title PubTerm: a web tool for organizing, annotating and curating genes, diseases, molecules and other concepts from PubMed records
title_full PubTerm: a web tool for organizing, annotating and curating genes, diseases, molecules and other concepts from PubMed records
title_fullStr PubTerm: a web tool for organizing, annotating and curating genes, diseases, molecules and other concepts from PubMed records
title_full_unstemmed PubTerm: a web tool for organizing, annotating and curating genes, diseases, molecules and other concepts from PubMed records
title_short PubTerm: a web tool for organizing, annotating and curating genes, diseases, molecules and other concepts from PubMed records
title_sort pubterm: a web tool for organizing, annotating and curating genes, diseases, molecules and other concepts from pubmed records
topic Database Tool
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6323318/
https://www.ncbi.nlm.nih.gov/pubmed/30624653
http://dx.doi.org/10.1093/database/bay137
work_keys_str_mv AT garciapelaezjose pubtermawebtoolfororganizingannotatingandcuratinggenesdiseasesmoleculesandotherconceptsfrompubmedrecords
AT rodriguezdavid pubtermawebtoolfororganizingannotatingandcuratinggenesdiseasesmoleculesandotherconceptsfrompubmedrecords
AT medinamolinaroberto pubtermawebtoolfororganizingannotatingandcuratinggenesdiseasesmoleculesandotherconceptsfrompubmedrecords
AT garciarivasgerardo pubtermawebtoolfororganizingannotatingandcuratinggenesdiseasesmoleculesandotherconceptsfrompubmedrecords
AT jerjessanchezcarlos pubtermawebtoolfororganizingannotatingandcuratinggenesdiseasesmoleculesandotherconceptsfrompubmedrecords
AT trevinovictor pubtermawebtoolfororganizingannotatingandcuratinggenesdiseasesmoleculesandotherconceptsfrompubmedrecords