Cargando…

Pathogenic variation types in human genes relate to diseases through Pfam and InterPro mapping

Grouping residue variations in a protein according to their physicochemical properties allows a dimensionality reduction of all the possible substitutions in a variant with respect to the wild type. Here, by using a large dataset of proteins with disease-related and benign variations, as derived by...

Descripción completa

Detalles Bibliográficos
Autores principales: Babbi, Giulia, Savojardo, Castrense, Baldazzi, Davide, Martelli, Pier Luigi, Casadio, Rita
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9523224/
https://www.ncbi.nlm.nih.gov/pubmed/36188216
http://dx.doi.org/10.3389/fmolb.2022.966927
_version_ 1784800226806071296
author Babbi, Giulia
Savojardo, Castrense
Baldazzi, Davide
Martelli, Pier Luigi
Casadio, Rita
author_facet Babbi, Giulia
Savojardo, Castrense
Baldazzi, Davide
Martelli, Pier Luigi
Casadio, Rita
author_sort Babbi, Giulia
collection PubMed
description Grouping residue variations in a protein according to their physicochemical properties allows a dimensionality reduction of all the possible substitutions in a variant with respect to the wild type. Here, by using a large dataset of proteins with disease-related and benign variations, as derived by merging Humsavar and ClinVar data, we investigate to which extent our physicochemical grouping procedure can help in determining whether patterns of variation types are related to specific groups of diseases and whether they occur in Pfam and/or InterPro gene domains. Here, we download 75,145 germline disease-related and benign variations of 3,605 genes, group them according to physicochemical categories and map them into Pfam and InterPro gene domains. Statistically validated analysis indicates that each cluster of genes associated to Mondo anatomical system categorizations is characterized by a specific variation pattern. Patterns identify specific Pfam and InterPro domain–Mondo category associations. Our data suggest that the association of variation patterns to Mondo categories is unique and may help in associating gene variants to genetic diseases. This work corroborates in a much larger data set previous observations from our group.
format Online
Article
Text
id pubmed-9523224
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-95232242022-10-01 Pathogenic variation types in human genes relate to diseases through Pfam and InterPro mapping Babbi, Giulia Savojardo, Castrense Baldazzi, Davide Martelli, Pier Luigi Casadio, Rita Front Mol Biosci Molecular Biosciences Grouping residue variations in a protein according to their physicochemical properties allows a dimensionality reduction of all the possible substitutions in a variant with respect to the wild type. Here, by using a large dataset of proteins with disease-related and benign variations, as derived by merging Humsavar and ClinVar data, we investigate to which extent our physicochemical grouping procedure can help in determining whether patterns of variation types are related to specific groups of diseases and whether they occur in Pfam and/or InterPro gene domains. Here, we download 75,145 germline disease-related and benign variations of 3,605 genes, group them according to physicochemical categories and map them into Pfam and InterPro gene domains. Statistically validated analysis indicates that each cluster of genes associated to Mondo anatomical system categorizations is characterized by a specific variation pattern. Patterns identify specific Pfam and InterPro domain–Mondo category associations. Our data suggest that the association of variation patterns to Mondo categories is unique and may help in associating gene variants to genetic diseases. This work corroborates in a much larger data set previous observations from our group. Frontiers Media S.A. 2022-09-16 /pmc/articles/PMC9523224/ /pubmed/36188216 http://dx.doi.org/10.3389/fmolb.2022.966927 Text en Copyright © 2022 Babbi, Savojardo, Baldazzi, Martelli and Casadio. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Molecular Biosciences
Babbi, Giulia
Savojardo, Castrense
Baldazzi, Davide
Martelli, Pier Luigi
Casadio, Rita
Pathogenic variation types in human genes relate to diseases through Pfam and InterPro mapping
title Pathogenic variation types in human genes relate to diseases through Pfam and InterPro mapping
title_full Pathogenic variation types in human genes relate to diseases through Pfam and InterPro mapping
title_fullStr Pathogenic variation types in human genes relate to diseases through Pfam and InterPro mapping
title_full_unstemmed Pathogenic variation types in human genes relate to diseases through Pfam and InterPro mapping
title_short Pathogenic variation types in human genes relate to diseases through Pfam and InterPro mapping
title_sort pathogenic variation types in human genes relate to diseases through pfam and interpro mapping
topic Molecular Biosciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9523224/
https://www.ncbi.nlm.nih.gov/pubmed/36188216
http://dx.doi.org/10.3389/fmolb.2022.966927
work_keys_str_mv AT babbigiulia pathogenicvariationtypesinhumangenesrelatetodiseasesthroughpfamandinterpromapping
AT savojardocastrense pathogenicvariationtypesinhumangenesrelatetodiseasesthroughpfamandinterpromapping
AT baldazzidavide pathogenicvariationtypesinhumangenesrelatetodiseasesthroughpfamandinterpromapping
AT martellipierluigi pathogenicvariationtypesinhumangenesrelatetodiseasesthroughpfamandinterpromapping
AT casadiorita pathogenicvariationtypesinhumangenesrelatetodiseasesthroughpfamandinterpromapping