Cargando…
Text mining and expert curation to develop a database on psychiatric diseases and their genes
Psychiatric disorders constitute one of the main causes of disability worldwide. During the past years, considerable research has been conducted on the genetic architecture of such diseases, although little understanding of their etiology has been achieved. The difficulty to access up-to-date, relev...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5502359/ https://www.ncbi.nlm.nih.gov/pubmed/29220439 http://dx.doi.org/10.1093/database/bax043 |
_version_ | 1783248941028999168 |
---|---|
author | Gutiérrez-Sacristán, Alba Bravo, Àlex Portero-Tresserra, Marta Valverde, Olga Armario, Antonio Blanco-Gandía, M.C. Farré, Adriana Fernández-Ibarrondo, Lierni Fonseca, Francina Giraldo, Jesús Leis, Angela Mané, Anna Mayer, M.A. Montagud-Romero, Sandra Nadal, Roser Ortiz, Jordi Pavon, Francisco Javier Perez, Ezequiel Jesús Rodríguez-Arias, Marta Serrano, Antonia Torrens, Marta Warnault, Vincent Sanz, Ferran Furlong, Laura I. |
author_facet | Gutiérrez-Sacristán, Alba Bravo, Àlex Portero-Tresserra, Marta Valverde, Olga Armario, Antonio Blanco-Gandía, M.C. Farré, Adriana Fernández-Ibarrondo, Lierni Fonseca, Francina Giraldo, Jesús Leis, Angela Mané, Anna Mayer, M.A. Montagud-Romero, Sandra Nadal, Roser Ortiz, Jordi Pavon, Francisco Javier Perez, Ezequiel Jesús Rodríguez-Arias, Marta Serrano, Antonia Torrens, Marta Warnault, Vincent Sanz, Ferran Furlong, Laura I. |
author_sort | Gutiérrez-Sacristán, Alba |
collection | PubMed |
description | Psychiatric disorders constitute one of the main causes of disability worldwide. During the past years, considerable research has been conducted on the genetic architecture of such diseases, although little understanding of their etiology has been achieved. The difficulty to access up-to-date, relevant genotype-phenotype information has hampered the application of this wealth of knowledge to translational research and clinical practice in order to improve diagnosis and treatment of psychiatric patients. PsyGeNET (http://www.psygenet.org/) has been developed with the aim of supporting research on the genetic architecture of psychiatric diseases, by providing integrated and structured accessibility to their genotype–phenotype association data, together with analysis and visualization tools. In this article, we describe the protocol developed for the sustainable update of this knowledge resource. It includes the recruitment of a team of domain experts in order to perform the curation of the data extracted by text mining. Annotation guidelines and a web-based annotation tool were developed to support the curators’ tasks. A curation workflow was designed including a pilot phase and two rounds of curation and analysis phases. Negative evidence from the literature on gene–disease associations (GDAs) was taken into account in the curation process. We report the results of the application of this workflow to the curation of GDAs for PsyGeNET, including the analysis of the inter-annotator agreement and suggest this model as a suitable approach for the sustainable development and update of knowledge resources. Database URL: http://www.psygenet.org PsyGeNET corpus: http://www.psygenet.org/ds/PsyGeNET/results/psygenetCorpus.tar |
format | Online Article Text |
id | pubmed-5502359 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-55023592017-07-20 Text mining and expert curation to develop a database on psychiatric diseases and their genes Gutiérrez-Sacristán, Alba Bravo, Àlex Portero-Tresserra, Marta Valverde, Olga Armario, Antonio Blanco-Gandía, M.C. Farré, Adriana Fernández-Ibarrondo, Lierni Fonseca, Francina Giraldo, Jesús Leis, Angela Mané, Anna Mayer, M.A. Montagud-Romero, Sandra Nadal, Roser Ortiz, Jordi Pavon, Francisco Javier Perez, Ezequiel Jesús Rodríguez-Arias, Marta Serrano, Antonia Torrens, Marta Warnault, Vincent Sanz, Ferran Furlong, Laura I. Database (Oxford) Original Article Psychiatric disorders constitute one of the main causes of disability worldwide. During the past years, considerable research has been conducted on the genetic architecture of such diseases, although little understanding of their etiology has been achieved. The difficulty to access up-to-date, relevant genotype-phenotype information has hampered the application of this wealth of knowledge to translational research and clinical practice in order to improve diagnosis and treatment of psychiatric patients. PsyGeNET (http://www.psygenet.org/) has been developed with the aim of supporting research on the genetic architecture of psychiatric diseases, by providing integrated and structured accessibility to their genotype–phenotype association data, together with analysis and visualization tools. In this article, we describe the protocol developed for the sustainable update of this knowledge resource. It includes the recruitment of a team of domain experts in order to perform the curation of the data extracted by text mining. Annotation guidelines and a web-based annotation tool were developed to support the curators’ tasks. A curation workflow was designed including a pilot phase and two rounds of curation and analysis phases. Negative evidence from the literature on gene–disease associations (GDAs) was taken into account in the curation process. We report the results of the application of this workflow to the curation of GDAs for PsyGeNET, including the analysis of the inter-annotator agreement and suggest this model as a suitable approach for the sustainable development and update of knowledge resources. Database URL: http://www.psygenet.org PsyGeNET corpus: http://www.psygenet.org/ds/PsyGeNET/results/psygenetCorpus.tar Oxford University Press 2017-06-26 /pmc/articles/PMC5502359/ /pubmed/29220439 http://dx.doi.org/10.1093/database/bax043 Text en © The Author(s) 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Gutiérrez-Sacristán, Alba Bravo, Àlex Portero-Tresserra, Marta Valverde, Olga Armario, Antonio Blanco-Gandía, M.C. Farré, Adriana Fernández-Ibarrondo, Lierni Fonseca, Francina Giraldo, Jesús Leis, Angela Mané, Anna Mayer, M.A. Montagud-Romero, Sandra Nadal, Roser Ortiz, Jordi Pavon, Francisco Javier Perez, Ezequiel Jesús Rodríguez-Arias, Marta Serrano, Antonia Torrens, Marta Warnault, Vincent Sanz, Ferran Furlong, Laura I. Text mining and expert curation to develop a database on psychiatric diseases and their genes |
title | Text mining and expert curation to develop a database on psychiatric diseases and their genes |
title_full | Text mining and expert curation to develop a database on psychiatric diseases and their genes |
title_fullStr | Text mining and expert curation to develop a database on psychiatric diseases and their genes |
title_full_unstemmed | Text mining and expert curation to develop a database on psychiatric diseases and their genes |
title_short | Text mining and expert curation to develop a database on psychiatric diseases and their genes |
title_sort | text mining and expert curation to develop a database on psychiatric diseases and their genes |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5502359/ https://www.ncbi.nlm.nih.gov/pubmed/29220439 http://dx.doi.org/10.1093/database/bax043 |
work_keys_str_mv | AT gutierrezsacristanalba textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes AT bravoalex textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes AT porterotresserramarta textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes AT valverdeolga textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes AT armarioantonio textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes AT blancogandiamc textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes AT farreadriana textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes AT fernandezibarrondolierni textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes AT fonsecafrancina textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes AT giraldojesus textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes AT leisangela textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes AT maneanna textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes AT mayerma textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes AT montagudromerosandra textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes AT nadalroser textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes AT ortizjordi textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes AT pavonfranciscojavier textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes AT perezezequieljesus textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes AT rodriguezariasmarta textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes AT serranoantonia textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes AT torrensmarta textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes AT warnaultvincent textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes AT sanzferran textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes AT furlonglaurai textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes |