Cargando…

Unsupervised machine learning algorithms identify expected haemorrhage relationships but define unexplained coagulation profiles mapping to thrombotic phenotypes in hereditary haemorrhagic telangiectasia

Hereditary haemorrhagic telangiectasia (HHT) can result in challenging anaemia and thrombosis phenotypes. Clinical presentations of HHT vary for relatives with identical casual mutations, suggesting other factors may modify severity. To examine objectively, we developed unsupervised machine learning...

Descripción completa

Detalles Bibliográficos
Autores principales: Mukhtar, Ghazel, Shovlin, Claire L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10435691/
https://www.ncbi.nlm.nih.gov/pubmed/37601877
http://dx.doi.org/10.1002/jha2.746
_version_ 1785092158965940224
author Mukhtar, Ghazel
Shovlin, Claire L.
author_facet Mukhtar, Ghazel
Shovlin, Claire L.
author_sort Mukhtar, Ghazel
collection PubMed
description Hereditary haemorrhagic telangiectasia (HHT) can result in challenging anaemia and thrombosis phenotypes. Clinical presentations of HHT vary for relatives with identical casual mutations, suggesting other factors may modify severity. To examine objectively, we developed unsupervised machine learning algorithms to test whether haematological data at presentation could be categorised into sub‐groupings and fitted to known biological factors. With ethical approval, we examined 10 complete blood count (CBC) variables, four iron index variables, four coagulation variables and eight iron/coagulation indices combined from 336 genotyped HHT patients (40% male, 60% female, 86.5% not using iron supplementation) at a single centre. T‐SNE unsupervised, dimension reduction, machine learning algorithms assigned each high‐dimensional datapoint to a location in a two‐dimensional plane. k‐Means clustering algorithms grouped into profiles, enabling visualisation and inter‐profile comparisons of patients’ clinical and genetic features. The unsupervised machine learning algorithms using t‐SNE and k‐Means identified two distinct CBC profiles, two iron profiles, four clotting profiles and three combined profiles. Validating the methodology, profiles for CBC or iron indices fitted expected patterns for haemorrhage. Distinct coagulation profiles displayed no association with age, sex, C‐reactive protein, pulmonary arteriovenous malformations (AVMs), ENG/ACVRL1 genotype or epistaxis severity. The most distinct profiles were from t‐SNE/k‐Means analyses of combined iron‐coagulation indices and mapped to three risk states – for venous thromboembolism in HHT; for ischaemic stroke attributed to paradoxical emboli through pulmonary AVMs in HHT; and for cerebral abscess attributed to odontogenic bacteremias in immunocompetent HHT patients with right‐to‐left shunting through pulmonary AVMs. In conclusion, unsupervised machine learning algorithms categorise HHT haematological indices into distinct, clinically relevant profiles which are independent of age, sex or HHT genotype. Further evaluation may inform prophylaxis and management for HHT patients’ haemorrhagic and thrombotic phenotypes.
format Online
Article
Text
id pubmed-10435691
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-104356912023-08-19 Unsupervised machine learning algorithms identify expected haemorrhage relationships but define unexplained coagulation profiles mapping to thrombotic phenotypes in hereditary haemorrhagic telangiectasia Mukhtar, Ghazel Shovlin, Claire L. EJHaem Sickle Cell, Thrombosis, and Classical Haematology Hereditary haemorrhagic telangiectasia (HHT) can result in challenging anaemia and thrombosis phenotypes. Clinical presentations of HHT vary for relatives with identical casual mutations, suggesting other factors may modify severity. To examine objectively, we developed unsupervised machine learning algorithms to test whether haematological data at presentation could be categorised into sub‐groupings and fitted to known biological factors. With ethical approval, we examined 10 complete blood count (CBC) variables, four iron index variables, four coagulation variables and eight iron/coagulation indices combined from 336 genotyped HHT patients (40% male, 60% female, 86.5% not using iron supplementation) at a single centre. T‐SNE unsupervised, dimension reduction, machine learning algorithms assigned each high‐dimensional datapoint to a location in a two‐dimensional plane. k‐Means clustering algorithms grouped into profiles, enabling visualisation and inter‐profile comparisons of patients’ clinical and genetic features. The unsupervised machine learning algorithms using t‐SNE and k‐Means identified two distinct CBC profiles, two iron profiles, four clotting profiles and three combined profiles. Validating the methodology, profiles for CBC or iron indices fitted expected patterns for haemorrhage. Distinct coagulation profiles displayed no association with age, sex, C‐reactive protein, pulmonary arteriovenous malformations (AVMs), ENG/ACVRL1 genotype or epistaxis severity. The most distinct profiles were from t‐SNE/k‐Means analyses of combined iron‐coagulation indices and mapped to three risk states – for venous thromboembolism in HHT; for ischaemic stroke attributed to paradoxical emboli through pulmonary AVMs in HHT; and for cerebral abscess attributed to odontogenic bacteremias in immunocompetent HHT patients with right‐to‐left shunting through pulmonary AVMs. In conclusion, unsupervised machine learning algorithms categorise HHT haematological indices into distinct, clinically relevant profiles which are independent of age, sex or HHT genotype. Further evaluation may inform prophylaxis and management for HHT patients’ haemorrhagic and thrombotic phenotypes. John Wiley and Sons Inc. 2023-07-03 /pmc/articles/PMC10435691/ /pubmed/37601877 http://dx.doi.org/10.1002/jha2.746 Text en © 2023 The Authors. eJHaem published by British Society for Haematology and John Wiley & Sons Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Sickle Cell, Thrombosis, and Classical Haematology
Mukhtar, Ghazel
Shovlin, Claire L.
Unsupervised machine learning algorithms identify expected haemorrhage relationships but define unexplained coagulation profiles mapping to thrombotic phenotypes in hereditary haemorrhagic telangiectasia
title Unsupervised machine learning algorithms identify expected haemorrhage relationships but define unexplained coagulation profiles mapping to thrombotic phenotypes in hereditary haemorrhagic telangiectasia
title_full Unsupervised machine learning algorithms identify expected haemorrhage relationships but define unexplained coagulation profiles mapping to thrombotic phenotypes in hereditary haemorrhagic telangiectasia
title_fullStr Unsupervised machine learning algorithms identify expected haemorrhage relationships but define unexplained coagulation profiles mapping to thrombotic phenotypes in hereditary haemorrhagic telangiectasia
title_full_unstemmed Unsupervised machine learning algorithms identify expected haemorrhage relationships but define unexplained coagulation profiles mapping to thrombotic phenotypes in hereditary haemorrhagic telangiectasia
title_short Unsupervised machine learning algorithms identify expected haemorrhage relationships but define unexplained coagulation profiles mapping to thrombotic phenotypes in hereditary haemorrhagic telangiectasia
title_sort unsupervised machine learning algorithms identify expected haemorrhage relationships but define unexplained coagulation profiles mapping to thrombotic phenotypes in hereditary haemorrhagic telangiectasia
topic Sickle Cell, Thrombosis, and Classical Haematology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10435691/
https://www.ncbi.nlm.nih.gov/pubmed/37601877
http://dx.doi.org/10.1002/jha2.746
work_keys_str_mv AT mukhtarghazel unsupervisedmachinelearningalgorithmsidentifyexpectedhaemorrhagerelationshipsbutdefineunexplainedcoagulationprofilesmappingtothromboticphenotypesinhereditaryhaemorrhagictelangiectasia
AT shovlinclairel unsupervisedmachinelearningalgorithmsidentifyexpectedhaemorrhagerelationshipsbutdefineunexplainedcoagulationprofilesmappingtothromboticphenotypesinhereditaryhaemorrhagictelangiectasia