Cargando…
Text mining of gene–phenotype associations reveals new phenotypic profiles of autism-associated genes
Autism is a spectrum disorder with wide variation in type and severity of symptoms. Understanding gene–phenotype associations is vital to unravel the disease mechanisms and advance its diagnosis and treatment. To date, several databases have stored a large portion of gene–phenotype associations whic...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8316556/ https://www.ncbi.nlm.nih.gov/pubmed/34315992 http://dx.doi.org/10.1038/s41598-021-94742-z |
_version_ | 1783729875715096576 |
---|---|
author | Li, Sijie Guo, Ziqi Ioffe, Jacob B. Hu, Yunfei Zhen, Yi Zhou, Xin |
author_facet | Li, Sijie Guo, Ziqi Ioffe, Jacob B. Hu, Yunfei Zhen, Yi Zhou, Xin |
author_sort | Li, Sijie |
collection | PubMed |
description | Autism is a spectrum disorder with wide variation in type and severity of symptoms. Understanding gene–phenotype associations is vital to unravel the disease mechanisms and advance its diagnosis and treatment. To date, several databases have stored a large portion of gene–phenotype associations which are mainly obtained from genetic experiments. However, a large proportion of gene–phenotype associations are still buried in the autism-related literature and there are limited resources to investigate autism-associated gene–phenotype associations. Given the abundance of the autism-related literature, we were thus motivated to develop Autism_genepheno, a text mining pipeline to identify sentence-level mentions of autism-associated genes and phenotypes in literature through natural language processing methods. We have generated a comprehensive database of gene–phenotype associations in the last five years’ autism-related literature that can be easily updated as new literature becomes available. We have evaluated our pipeline through several different approaches, and we are able to rank and select top autism-associated genes through their unique and wide spectrum of phenotypic profiles, which could provide a unique resource for the diagnosis and treatment of autism. The data resources and the Autism_genpheno pipeline are available at: https://github.com/maiziezhoulab/Autism_genepheno. |
format | Online Article Text |
id | pubmed-8316556 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-83165562021-07-29 Text mining of gene–phenotype associations reveals new phenotypic profiles of autism-associated genes Li, Sijie Guo, Ziqi Ioffe, Jacob B. Hu, Yunfei Zhen, Yi Zhou, Xin Sci Rep Article Autism is a spectrum disorder with wide variation in type and severity of symptoms. Understanding gene–phenotype associations is vital to unravel the disease mechanisms and advance its diagnosis and treatment. To date, several databases have stored a large portion of gene–phenotype associations which are mainly obtained from genetic experiments. However, a large proportion of gene–phenotype associations are still buried in the autism-related literature and there are limited resources to investigate autism-associated gene–phenotype associations. Given the abundance of the autism-related literature, we were thus motivated to develop Autism_genepheno, a text mining pipeline to identify sentence-level mentions of autism-associated genes and phenotypes in literature through natural language processing methods. We have generated a comprehensive database of gene–phenotype associations in the last five years’ autism-related literature that can be easily updated as new literature becomes available. We have evaluated our pipeline through several different approaches, and we are able to rank and select top autism-associated genes through their unique and wide spectrum of phenotypic profiles, which could provide a unique resource for the diagnosis and treatment of autism. The data resources and the Autism_genpheno pipeline are available at: https://github.com/maiziezhoulab/Autism_genepheno. Nature Publishing Group UK 2021-07-27 /pmc/articles/PMC8316556/ /pubmed/34315992 http://dx.doi.org/10.1038/s41598-021-94742-z Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Li, Sijie Guo, Ziqi Ioffe, Jacob B. Hu, Yunfei Zhen, Yi Zhou, Xin Text mining of gene–phenotype associations reveals new phenotypic profiles of autism-associated genes |
title | Text mining of gene–phenotype associations reveals new phenotypic profiles of autism-associated genes |
title_full | Text mining of gene–phenotype associations reveals new phenotypic profiles of autism-associated genes |
title_fullStr | Text mining of gene–phenotype associations reveals new phenotypic profiles of autism-associated genes |
title_full_unstemmed | Text mining of gene–phenotype associations reveals new phenotypic profiles of autism-associated genes |
title_short | Text mining of gene–phenotype associations reveals new phenotypic profiles of autism-associated genes |
title_sort | text mining of gene–phenotype associations reveals new phenotypic profiles of autism-associated genes |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8316556/ https://www.ncbi.nlm.nih.gov/pubmed/34315992 http://dx.doi.org/10.1038/s41598-021-94742-z |
work_keys_str_mv | AT lisijie textminingofgenephenotypeassociationsrevealsnewphenotypicprofilesofautismassociatedgenes AT guoziqi textminingofgenephenotypeassociationsrevealsnewphenotypicprofilesofautismassociatedgenes AT ioffejacobb textminingofgenephenotypeassociationsrevealsnewphenotypicprofilesofautismassociatedgenes AT huyunfei textminingofgenephenotypeassociationsrevealsnewphenotypicprofilesofautismassociatedgenes AT zhenyi textminingofgenephenotypeassociationsrevealsnewphenotypicprofilesofautismassociatedgenes AT zhouxin textminingofgenephenotypeassociationsrevealsnewphenotypicprofilesofautismassociatedgenes |