Cargando…

Overview of the BioCreative VI text-mining services for Kinome Curation Track

The text-mining services for kinome curation track, part of BioCreative VI, proposed a competition to assess the effectiveness of text mining to perform literature triage. The track has exploited an unpublished curated data set from the neXtProt database. This data set contained comprehensive annota...

Descripción completa

Detalles Bibliográficos
Autores principales: Gobeill, Julien, Gaudet, Pascale, Dopp, Daniel, Morrone, Adam, Kahanda, Indika, Hsu, Yi-Yu, Wei, Chih-Hsuan, Lu, Zhiyong, Ruch, Patrick
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6191643/
https://www.ncbi.nlm.nih.gov/pubmed/30329035
http://dx.doi.org/10.1093/database/bay104
_version_ 1783363753685811200
author Gobeill, Julien
Gaudet, Pascale
Dopp, Daniel
Morrone, Adam
Kahanda, Indika
Hsu, Yi-Yu
Wei, Chih-Hsuan
Lu, Zhiyong
Ruch, Patrick
author_facet Gobeill, Julien
Gaudet, Pascale
Dopp, Daniel
Morrone, Adam
Kahanda, Indika
Hsu, Yi-Yu
Wei, Chih-Hsuan
Lu, Zhiyong
Ruch, Patrick
author_sort Gobeill, Julien
collection PubMed
description The text-mining services for kinome curation track, part of BioCreative VI, proposed a competition to assess the effectiveness of text mining to perform literature triage. The track has exploited an unpublished curated data set from the neXtProt database. This data set contained comprehensive annotations for 300 human protein kinases. For a given protein and a given curation axis [diseases or gene ontology (GO) biological processes], participants’ systems had to identify and rank relevant articles in a collection of 5.2 M MEDLINE citations (task 1) or 530 000 full-text articles (task 2). Explored strategies comprised named-entity recognition and machine-learning frameworks. For that latter approach, participants developed methods to derive a set of negative instances, as the databases typically do not store articles that were judged as irrelevant by curators. The supervised approaches proposed by the participating groups achieved significant improvements compared to the baseline established in a previous study and compared to a basic PubMed search.
format Online
Article
Text
id pubmed-6191643
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-61916432018-10-23 Overview of the BioCreative VI text-mining services for Kinome Curation Track Gobeill, Julien Gaudet, Pascale Dopp, Daniel Morrone, Adam Kahanda, Indika Hsu, Yi-Yu Wei, Chih-Hsuan Lu, Zhiyong Ruch, Patrick Database (Oxford) Original Article The text-mining services for kinome curation track, part of BioCreative VI, proposed a competition to assess the effectiveness of text mining to perform literature triage. The track has exploited an unpublished curated data set from the neXtProt database. This data set contained comprehensive annotations for 300 human protein kinases. For a given protein and a given curation axis [diseases or gene ontology (GO) biological processes], participants’ systems had to identify and rank relevant articles in a collection of 5.2 M MEDLINE citations (task 1) or 530 000 full-text articles (task 2). Explored strategies comprised named-entity recognition and machine-learning frameworks. For that latter approach, participants developed methods to derive a set of negative instances, as the databases typically do not store articles that were judged as irrelevant by curators. The supervised approaches proposed by the participating groups achieved significant improvements compared to the baseline established in a previous study and compared to a basic PubMed search. Oxford University Press 2018-10-17 /pmc/articles/PMC6191643/ /pubmed/30329035 http://dx.doi.org/10.1093/database/bay104 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Gobeill, Julien
Gaudet, Pascale
Dopp, Daniel
Morrone, Adam
Kahanda, Indika
Hsu, Yi-Yu
Wei, Chih-Hsuan
Lu, Zhiyong
Ruch, Patrick
Overview of the BioCreative VI text-mining services for Kinome Curation Track
title Overview of the BioCreative VI text-mining services for Kinome Curation Track
title_full Overview of the BioCreative VI text-mining services for Kinome Curation Track
title_fullStr Overview of the BioCreative VI text-mining services for Kinome Curation Track
title_full_unstemmed Overview of the BioCreative VI text-mining services for Kinome Curation Track
title_short Overview of the BioCreative VI text-mining services for Kinome Curation Track
title_sort overview of the biocreative vi text-mining services for kinome curation track
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6191643/
https://www.ncbi.nlm.nih.gov/pubmed/30329035
http://dx.doi.org/10.1093/database/bay104
work_keys_str_mv AT gobeilljulien overviewofthebiocreativevitextminingservicesforkinomecurationtrack
AT gaudetpascale overviewofthebiocreativevitextminingservicesforkinomecurationtrack
AT doppdaniel overviewofthebiocreativevitextminingservicesforkinomecurationtrack
AT morroneadam overviewofthebiocreativevitextminingservicesforkinomecurationtrack
AT kahandaindika overviewofthebiocreativevitextminingservicesforkinomecurationtrack
AT hsuyiyu overviewofthebiocreativevitextminingservicesforkinomecurationtrack
AT weichihhsuan overviewofthebiocreativevitextminingservicesforkinomecurationtrack
AT luzhiyong overviewofthebiocreativevitextminingservicesforkinomecurationtrack
AT ruchpatrick overviewofthebiocreativevitextminingservicesforkinomecurationtrack