Cargando…

Text mining and expert curation to develop a database on psychiatric diseases and their genes

Psychiatric disorders constitute one of the main causes of disability worldwide. During the past years, considerable research has been conducted on the genetic architecture of such diseases, although little understanding of their etiology has been achieved. The difficulty to access up-to-date, relev...

Descripción completa

Detalles Bibliográficos
Autores principales: Gutiérrez-Sacristán, Alba, Bravo, Àlex, Portero-Tresserra, Marta, Valverde, Olga, Armario, Antonio, Blanco-Gandía, M.C., Farré, Adriana, Fernández-Ibarrondo, Lierni, Fonseca, Francina, Giraldo, Jesús, Leis, Angela, Mané, Anna, Mayer, M.A., Montagud-Romero, Sandra, Nadal, Roser, Ortiz, Jordi, Pavon, Francisco Javier, Perez, Ezequiel Jesús, Rodríguez-Arias, Marta, Serrano, Antonia, Torrens, Marta, Warnault, Vincent, Sanz, Ferran, Furlong, Laura I.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5502359/
https://www.ncbi.nlm.nih.gov/pubmed/29220439
http://dx.doi.org/10.1093/database/bax043
_version_ 1783248941028999168
author Gutiérrez-Sacristán, Alba
Bravo, Àlex
Portero-Tresserra, Marta
Valverde, Olga
Armario, Antonio
Blanco-Gandía, M.C.
Farré, Adriana
Fernández-Ibarrondo, Lierni
Fonseca, Francina
Giraldo, Jesús
Leis, Angela
Mané, Anna
Mayer, M.A.
Montagud-Romero, Sandra
Nadal, Roser
Ortiz, Jordi
Pavon, Francisco Javier
Perez, Ezequiel Jesús
Rodríguez-Arias, Marta
Serrano, Antonia
Torrens, Marta
Warnault, Vincent
Sanz, Ferran
Furlong, Laura I.
author_facet Gutiérrez-Sacristán, Alba
Bravo, Àlex
Portero-Tresserra, Marta
Valverde, Olga
Armario, Antonio
Blanco-Gandía, M.C.
Farré, Adriana
Fernández-Ibarrondo, Lierni
Fonseca, Francina
Giraldo, Jesús
Leis, Angela
Mané, Anna
Mayer, M.A.
Montagud-Romero, Sandra
Nadal, Roser
Ortiz, Jordi
Pavon, Francisco Javier
Perez, Ezequiel Jesús
Rodríguez-Arias, Marta
Serrano, Antonia
Torrens, Marta
Warnault, Vincent
Sanz, Ferran
Furlong, Laura I.
author_sort Gutiérrez-Sacristán, Alba
collection PubMed
description Psychiatric disorders constitute one of the main causes of disability worldwide. During the past years, considerable research has been conducted on the genetic architecture of such diseases, although little understanding of their etiology has been achieved. The difficulty to access up-to-date, relevant genotype-phenotype information has hampered the application of this wealth of knowledge to translational research and clinical practice in order to improve diagnosis and treatment of psychiatric patients. PsyGeNET (http://www.psygenet.org/) has been developed with the aim of supporting research on the genetic architecture of psychiatric diseases, by providing integrated and structured accessibility to their genotype–phenotype association data, together with analysis and visualization tools. In this article, we describe the protocol developed for the sustainable update of this knowledge resource. It includes the recruitment of a team of domain experts in order to perform the curation of the data extracted by text mining. Annotation guidelines and a web-based annotation tool were developed to support the curators’ tasks. A curation workflow was designed including a pilot phase and two rounds of curation and analysis phases. Negative evidence from the literature on gene–disease associations (GDAs) was taken into account in the curation process. We report the results of the application of this workflow to the curation of GDAs for PsyGeNET, including the analysis of the inter-annotator agreement and suggest this model as a suitable approach for the sustainable development and update of knowledge resources. Database URL: http://www.psygenet.org PsyGeNET corpus: http://www.psygenet.org/ds/PsyGeNET/results/psygenetCorpus.tar
format Online
Article
Text
id pubmed-5502359
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-55023592017-07-20 Text mining and expert curation to develop a database on psychiatric diseases and their genes Gutiérrez-Sacristán, Alba Bravo, Àlex Portero-Tresserra, Marta Valverde, Olga Armario, Antonio Blanco-Gandía, M.C. Farré, Adriana Fernández-Ibarrondo, Lierni Fonseca, Francina Giraldo, Jesús Leis, Angela Mané, Anna Mayer, M.A. Montagud-Romero, Sandra Nadal, Roser Ortiz, Jordi Pavon, Francisco Javier Perez, Ezequiel Jesús Rodríguez-Arias, Marta Serrano, Antonia Torrens, Marta Warnault, Vincent Sanz, Ferran Furlong, Laura I. Database (Oxford) Original Article Psychiatric disorders constitute one of the main causes of disability worldwide. During the past years, considerable research has been conducted on the genetic architecture of such diseases, although little understanding of their etiology has been achieved. The difficulty to access up-to-date, relevant genotype-phenotype information has hampered the application of this wealth of knowledge to translational research and clinical practice in order to improve diagnosis and treatment of psychiatric patients. PsyGeNET (http://www.psygenet.org/) has been developed with the aim of supporting research on the genetic architecture of psychiatric diseases, by providing integrated and structured accessibility to their genotype–phenotype association data, together with analysis and visualization tools. In this article, we describe the protocol developed for the sustainable update of this knowledge resource. It includes the recruitment of a team of domain experts in order to perform the curation of the data extracted by text mining. Annotation guidelines and a web-based annotation tool were developed to support the curators’ tasks. A curation workflow was designed including a pilot phase and two rounds of curation and analysis phases. Negative evidence from the literature on gene–disease associations (GDAs) was taken into account in the curation process. We report the results of the application of this workflow to the curation of GDAs for PsyGeNET, including the analysis of the inter-annotator agreement and suggest this model as a suitable approach for the sustainable development and update of knowledge resources. Database URL: http://www.psygenet.org PsyGeNET corpus: http://www.psygenet.org/ds/PsyGeNET/results/psygenetCorpus.tar Oxford University Press 2017-06-26 /pmc/articles/PMC5502359/ /pubmed/29220439 http://dx.doi.org/10.1093/database/bax043 Text en © The Author(s) 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Gutiérrez-Sacristán, Alba
Bravo, Àlex
Portero-Tresserra, Marta
Valverde, Olga
Armario, Antonio
Blanco-Gandía, M.C.
Farré, Adriana
Fernández-Ibarrondo, Lierni
Fonseca, Francina
Giraldo, Jesús
Leis, Angela
Mané, Anna
Mayer, M.A.
Montagud-Romero, Sandra
Nadal, Roser
Ortiz, Jordi
Pavon, Francisco Javier
Perez, Ezequiel Jesús
Rodríguez-Arias, Marta
Serrano, Antonia
Torrens, Marta
Warnault, Vincent
Sanz, Ferran
Furlong, Laura I.
Text mining and expert curation to develop a database on psychiatric diseases and their genes
title Text mining and expert curation to develop a database on psychiatric diseases and their genes
title_full Text mining and expert curation to develop a database on psychiatric diseases and their genes
title_fullStr Text mining and expert curation to develop a database on psychiatric diseases and their genes
title_full_unstemmed Text mining and expert curation to develop a database on psychiatric diseases and their genes
title_short Text mining and expert curation to develop a database on psychiatric diseases and their genes
title_sort text mining and expert curation to develop a database on psychiatric diseases and their genes
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5502359/
https://www.ncbi.nlm.nih.gov/pubmed/29220439
http://dx.doi.org/10.1093/database/bax043
work_keys_str_mv AT gutierrezsacristanalba textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes
AT bravoalex textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes
AT porterotresserramarta textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes
AT valverdeolga textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes
AT armarioantonio textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes
AT blancogandiamc textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes
AT farreadriana textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes
AT fernandezibarrondolierni textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes
AT fonsecafrancina textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes
AT giraldojesus textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes
AT leisangela textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes
AT maneanna textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes
AT mayerma textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes
AT montagudromerosandra textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes
AT nadalroser textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes
AT ortizjordi textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes
AT pavonfranciscojavier textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes
AT perezezequieljesus textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes
AT rodriguezariasmarta textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes
AT serranoantonia textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes
AT torrensmarta textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes
AT warnaultvincent textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes
AT sanzferran textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes
AT furlonglaurai textminingandexpertcurationtodevelopadatabaseonpsychiatricdiseasesandtheirgenes