Cargando…

Text mining of gene–phenotype associations reveals new phenotypic profiles of autism-associated genes

Autism is a spectrum disorder with wide variation in type and severity of symptoms. Understanding gene–phenotype associations is vital to unravel the disease mechanisms and advance its diagnosis and treatment. To date, several databases have stored a large portion of gene–phenotype associations whic...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Sijie, Guo, Ziqi, Ioffe, Jacob B., Hu, Yunfei, Zhen, Yi, Zhou, Xin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8316556/
https://www.ncbi.nlm.nih.gov/pubmed/34315992
http://dx.doi.org/10.1038/s41598-021-94742-z
_version_ 1783729875715096576
author Li, Sijie
Guo, Ziqi
Ioffe, Jacob B.
Hu, Yunfei
Zhen, Yi
Zhou, Xin
author_facet Li, Sijie
Guo, Ziqi
Ioffe, Jacob B.
Hu, Yunfei
Zhen, Yi
Zhou, Xin
author_sort Li, Sijie
collection PubMed
description Autism is a spectrum disorder with wide variation in type and severity of symptoms. Understanding gene–phenotype associations is vital to unravel the disease mechanisms and advance its diagnosis and treatment. To date, several databases have stored a large portion of gene–phenotype associations which are mainly obtained from genetic experiments. However, a large proportion of gene–phenotype associations are still buried in the autism-related literature and there are limited resources to investigate autism-associated gene–phenotype associations. Given the abundance of the autism-related literature, we were thus motivated to develop Autism_genepheno, a text mining pipeline to identify sentence-level mentions of autism-associated genes and phenotypes in literature through natural language processing methods. We have generated a comprehensive database of gene–phenotype associations in the last five years’ autism-related literature that can be easily updated as new literature becomes available. We have evaluated our pipeline through several different approaches, and we are able to rank and select top autism-associated genes through their unique and wide spectrum of phenotypic profiles, which could provide a unique resource for the diagnosis and treatment of autism. The data resources and the Autism_genpheno pipeline are available at: https://github.com/maiziezhoulab/Autism_genepheno.
format Online
Article
Text
id pubmed-8316556
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-83165562021-07-29 Text mining of gene–phenotype associations reveals new phenotypic profiles of autism-associated genes Li, Sijie Guo, Ziqi Ioffe, Jacob B. Hu, Yunfei Zhen, Yi Zhou, Xin Sci Rep Article Autism is a spectrum disorder with wide variation in type and severity of symptoms. Understanding gene–phenotype associations is vital to unravel the disease mechanisms and advance its diagnosis and treatment. To date, several databases have stored a large portion of gene–phenotype associations which are mainly obtained from genetic experiments. However, a large proportion of gene–phenotype associations are still buried in the autism-related literature and there are limited resources to investigate autism-associated gene–phenotype associations. Given the abundance of the autism-related literature, we were thus motivated to develop Autism_genepheno, a text mining pipeline to identify sentence-level mentions of autism-associated genes and phenotypes in literature through natural language processing methods. We have generated a comprehensive database of gene–phenotype associations in the last five years’ autism-related literature that can be easily updated as new literature becomes available. We have evaluated our pipeline through several different approaches, and we are able to rank and select top autism-associated genes through their unique and wide spectrum of phenotypic profiles, which could provide a unique resource for the diagnosis and treatment of autism. The data resources and the Autism_genpheno pipeline are available at: https://github.com/maiziezhoulab/Autism_genepheno. Nature Publishing Group UK 2021-07-27 /pmc/articles/PMC8316556/ /pubmed/34315992 http://dx.doi.org/10.1038/s41598-021-94742-z Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Li, Sijie
Guo, Ziqi
Ioffe, Jacob B.
Hu, Yunfei
Zhen, Yi
Zhou, Xin
Text mining of gene–phenotype associations reveals new phenotypic profiles of autism-associated genes
title Text mining of gene–phenotype associations reveals new phenotypic profiles of autism-associated genes
title_full Text mining of gene–phenotype associations reveals new phenotypic profiles of autism-associated genes
title_fullStr Text mining of gene–phenotype associations reveals new phenotypic profiles of autism-associated genes
title_full_unstemmed Text mining of gene–phenotype associations reveals new phenotypic profiles of autism-associated genes
title_short Text mining of gene–phenotype associations reveals new phenotypic profiles of autism-associated genes
title_sort text mining of gene–phenotype associations reveals new phenotypic profiles of autism-associated genes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8316556/
https://www.ncbi.nlm.nih.gov/pubmed/34315992
http://dx.doi.org/10.1038/s41598-021-94742-z
work_keys_str_mv AT lisijie textminingofgenephenotypeassociationsrevealsnewphenotypicprofilesofautismassociatedgenes
AT guoziqi textminingofgenephenotypeassociationsrevealsnewphenotypicprofilesofautismassociatedgenes
AT ioffejacobb textminingofgenephenotypeassociationsrevealsnewphenotypicprofilesofautismassociatedgenes
AT huyunfei textminingofgenephenotypeassociationsrevealsnewphenotypicprofilesofautismassociatedgenes
AT zhenyi textminingofgenephenotypeassociationsrevealsnewphenotypicprofilesofautismassociatedgenes
AT zhouxin textminingofgenephenotypeassociationsrevealsnewphenotypicprofilesofautismassociatedgenes