Cargando…
Pathogenic variation types in human genes relate to diseases through Pfam and InterPro mapping
Grouping residue variations in a protein according to their physicochemical properties allows a dimensionality reduction of all the possible substitutions in a variant with respect to the wild type. Here, by using a large dataset of proteins with disease-related and benign variations, as derived by...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9523224/ https://www.ncbi.nlm.nih.gov/pubmed/36188216 http://dx.doi.org/10.3389/fmolb.2022.966927 |
_version_ | 1784800226806071296 |
---|---|
author | Babbi, Giulia Savojardo, Castrense Baldazzi, Davide Martelli, Pier Luigi Casadio, Rita |
author_facet | Babbi, Giulia Savojardo, Castrense Baldazzi, Davide Martelli, Pier Luigi Casadio, Rita |
author_sort | Babbi, Giulia |
collection | PubMed |
description | Grouping residue variations in a protein according to their physicochemical properties allows a dimensionality reduction of all the possible substitutions in a variant with respect to the wild type. Here, by using a large dataset of proteins with disease-related and benign variations, as derived by merging Humsavar and ClinVar data, we investigate to which extent our physicochemical grouping procedure can help in determining whether patterns of variation types are related to specific groups of diseases and whether they occur in Pfam and/or InterPro gene domains. Here, we download 75,145 germline disease-related and benign variations of 3,605 genes, group them according to physicochemical categories and map them into Pfam and InterPro gene domains. Statistically validated analysis indicates that each cluster of genes associated to Mondo anatomical system categorizations is characterized by a specific variation pattern. Patterns identify specific Pfam and InterPro domain–Mondo category associations. Our data suggest that the association of variation patterns to Mondo categories is unique and may help in associating gene variants to genetic diseases. This work corroborates in a much larger data set previous observations from our group. |
format | Online Article Text |
id | pubmed-9523224 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-95232242022-10-01 Pathogenic variation types in human genes relate to diseases through Pfam and InterPro mapping Babbi, Giulia Savojardo, Castrense Baldazzi, Davide Martelli, Pier Luigi Casadio, Rita Front Mol Biosci Molecular Biosciences Grouping residue variations in a protein according to their physicochemical properties allows a dimensionality reduction of all the possible substitutions in a variant with respect to the wild type. Here, by using a large dataset of proteins with disease-related and benign variations, as derived by merging Humsavar and ClinVar data, we investigate to which extent our physicochemical grouping procedure can help in determining whether patterns of variation types are related to specific groups of diseases and whether they occur in Pfam and/or InterPro gene domains. Here, we download 75,145 germline disease-related and benign variations of 3,605 genes, group them according to physicochemical categories and map them into Pfam and InterPro gene domains. Statistically validated analysis indicates that each cluster of genes associated to Mondo anatomical system categorizations is characterized by a specific variation pattern. Patterns identify specific Pfam and InterPro domain–Mondo category associations. Our data suggest that the association of variation patterns to Mondo categories is unique and may help in associating gene variants to genetic diseases. This work corroborates in a much larger data set previous observations from our group. Frontiers Media S.A. 2022-09-16 /pmc/articles/PMC9523224/ /pubmed/36188216 http://dx.doi.org/10.3389/fmolb.2022.966927 Text en Copyright © 2022 Babbi, Savojardo, Baldazzi, Martelli and Casadio. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Molecular Biosciences Babbi, Giulia Savojardo, Castrense Baldazzi, Davide Martelli, Pier Luigi Casadio, Rita Pathogenic variation types in human genes relate to diseases through Pfam and InterPro mapping |
title | Pathogenic variation types in human genes relate to diseases through Pfam and InterPro mapping |
title_full | Pathogenic variation types in human genes relate to diseases through Pfam and InterPro mapping |
title_fullStr | Pathogenic variation types in human genes relate to diseases through Pfam and InterPro mapping |
title_full_unstemmed | Pathogenic variation types in human genes relate to diseases through Pfam and InterPro mapping |
title_short | Pathogenic variation types in human genes relate to diseases through Pfam and InterPro mapping |
title_sort | pathogenic variation types in human genes relate to diseases through pfam and interpro mapping |
topic | Molecular Biosciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9523224/ https://www.ncbi.nlm.nih.gov/pubmed/36188216 http://dx.doi.org/10.3389/fmolb.2022.966927 |
work_keys_str_mv | AT babbigiulia pathogenicvariationtypesinhumangenesrelatetodiseasesthroughpfamandinterpromapping AT savojardocastrense pathogenicvariationtypesinhumangenesrelatetodiseasesthroughpfamandinterpromapping AT baldazzidavide pathogenicvariationtypesinhumangenesrelatetodiseasesthroughpfamandinterpromapping AT martellipierluigi pathogenicvariationtypesinhumangenesrelatetodiseasesthroughpfamandinterpromapping AT casadiorita pathogenicvariationtypesinhumangenesrelatetodiseasesthroughpfamandinterpromapping |