Cargando…

How Artificial Intelligence Can Improve Our Understanding of the Genes Associated with Endometriosis: Natural Language Processing of the PubMed Database

Endometriosis is a disease characterized by the development of endometrial tissue outside the uterus, but its cause remains largely unknown. Numerous genes have been studied and proposed to help explain its pathogenesis. However, the large number of these candidate genes has made functional validati...

Descripción completa

Detalles Bibliográficos
Autores principales: Bouaziz, J., Mashiach, R., Cohen, S., Kedem, A., Baron, A., Zajicek, M., Feldman, I., Seidman, D., Soriano, D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5884286/
https://www.ncbi.nlm.nih.gov/pubmed/29750165
http://dx.doi.org/10.1155/2018/6217812
_version_ 1783311796739768320
author Bouaziz, J.
Mashiach, R.
Cohen, S.
Kedem, A.
Baron, A.
Zajicek, M.
Feldman, I.
Seidman, D.
Soriano, D.
author_facet Bouaziz, J.
Mashiach, R.
Cohen, S.
Kedem, A.
Baron, A.
Zajicek, M.
Feldman, I.
Seidman, D.
Soriano, D.
author_sort Bouaziz, J.
collection PubMed
description Endometriosis is a disease characterized by the development of endometrial tissue outside the uterus, but its cause remains largely unknown. Numerous genes have been studied and proposed to help explain its pathogenesis. However, the large number of these candidate genes has made functional validation through experimental methodologies nearly impossible. Computational methods could provide a useful alternative for prioritizing those most likely to be susceptibility genes. Using artificial intelligence applied to text mining, this study analyzed the genes involved in the pathogenesis, development, and progression of endometriosis. The data extraction by text mining of the endometriosis-related genes in the PubMed database was based on natural language processing, and the data were filtered to remove false positives. Using data from the text mining and gene network information as input for the web-based tool, 15,207 endometriosis-related genes were ranked according to their score in the database. Characterization of the filtered gene set through gene ontology, pathway, and network analysis provided information about the numerous mechanisms hypothesized to be responsible for the establishment of ectopic endometrial tissue, as well as the migration, implantation, survival, and proliferation of ectopic endometrial cells. Finally, the human genome was scanned through various databases using filtered genes as a seed to determine novel genes that might also be involved in the pathogenesis of endometriosis but which have not yet been characterized. These genes could be promising candidates to serve as useful diagnostic biomarkers and therapeutic targets in the management of endometriosis.
format Online
Article
Text
id pubmed-5884286
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-58842862018-05-10 How Artificial Intelligence Can Improve Our Understanding of the Genes Associated with Endometriosis: Natural Language Processing of the PubMed Database Bouaziz, J. Mashiach, R. Cohen, S. Kedem, A. Baron, A. Zajicek, M. Feldman, I. Seidman, D. Soriano, D. Biomed Res Int Review Article Endometriosis is a disease characterized by the development of endometrial tissue outside the uterus, but its cause remains largely unknown. Numerous genes have been studied and proposed to help explain its pathogenesis. However, the large number of these candidate genes has made functional validation through experimental methodologies nearly impossible. Computational methods could provide a useful alternative for prioritizing those most likely to be susceptibility genes. Using artificial intelligence applied to text mining, this study analyzed the genes involved in the pathogenesis, development, and progression of endometriosis. The data extraction by text mining of the endometriosis-related genes in the PubMed database was based on natural language processing, and the data were filtered to remove false positives. Using data from the text mining and gene network information as input for the web-based tool, 15,207 endometriosis-related genes were ranked according to their score in the database. Characterization of the filtered gene set through gene ontology, pathway, and network analysis provided information about the numerous mechanisms hypothesized to be responsible for the establishment of ectopic endometrial tissue, as well as the migration, implantation, survival, and proliferation of ectopic endometrial cells. Finally, the human genome was scanned through various databases using filtered genes as a seed to determine novel genes that might also be involved in the pathogenesis of endometriosis but which have not yet been characterized. These genes could be promising candidates to serve as useful diagnostic biomarkers and therapeutic targets in the management of endometriosis. Hindawi 2018-03-20 /pmc/articles/PMC5884286/ /pubmed/29750165 http://dx.doi.org/10.1155/2018/6217812 Text en Copyright © 2018 J. Bouaziz et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Review Article
Bouaziz, J.
Mashiach, R.
Cohen, S.
Kedem, A.
Baron, A.
Zajicek, M.
Feldman, I.
Seidman, D.
Soriano, D.
How Artificial Intelligence Can Improve Our Understanding of the Genes Associated with Endometriosis: Natural Language Processing of the PubMed Database
title How Artificial Intelligence Can Improve Our Understanding of the Genes Associated with Endometriosis: Natural Language Processing of the PubMed Database
title_full How Artificial Intelligence Can Improve Our Understanding of the Genes Associated with Endometriosis: Natural Language Processing of the PubMed Database
title_fullStr How Artificial Intelligence Can Improve Our Understanding of the Genes Associated with Endometriosis: Natural Language Processing of the PubMed Database
title_full_unstemmed How Artificial Intelligence Can Improve Our Understanding of the Genes Associated with Endometriosis: Natural Language Processing of the PubMed Database
title_short How Artificial Intelligence Can Improve Our Understanding of the Genes Associated with Endometriosis: Natural Language Processing of the PubMed Database
title_sort how artificial intelligence can improve our understanding of the genes associated with endometriosis: natural language processing of the pubmed database
topic Review Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5884286/
https://www.ncbi.nlm.nih.gov/pubmed/29750165
http://dx.doi.org/10.1155/2018/6217812
work_keys_str_mv AT bouazizj howartificialintelligencecanimproveourunderstandingofthegenesassociatedwithendometriosisnaturallanguageprocessingofthepubmeddatabase
AT mashiachr howartificialintelligencecanimproveourunderstandingofthegenesassociatedwithendometriosisnaturallanguageprocessingofthepubmeddatabase
AT cohens howartificialintelligencecanimproveourunderstandingofthegenesassociatedwithendometriosisnaturallanguageprocessingofthepubmeddatabase
AT kedema howartificialintelligencecanimproveourunderstandingofthegenesassociatedwithendometriosisnaturallanguageprocessingofthepubmeddatabase
AT barona howartificialintelligencecanimproveourunderstandingofthegenesassociatedwithendometriosisnaturallanguageprocessingofthepubmeddatabase
AT zajicekm howartificialintelligencecanimproveourunderstandingofthegenesassociatedwithendometriosisnaturallanguageprocessingofthepubmeddatabase
AT feldmani howartificialintelligencecanimproveourunderstandingofthegenesassociatedwithendometriosisnaturallanguageprocessingofthepubmeddatabase
AT seidmand howartificialintelligencecanimproveourunderstandingofthegenesassociatedwithendometriosisnaturallanguageprocessingofthepubmeddatabase
AT sorianod howartificialintelligencecanimproveourunderstandingofthegenesassociatedwithendometriosisnaturallanguageprocessingofthepubmeddatabase