Cargando…

Genetic signature of differentiated thyroid carcinoma susceptibility: a machine learning approach

To identify a peculiar genetic combination predisposing to differentiated thyroid carcinoma (DTC), we selected a set of single nucleotide polymorphisms (SNPs) associated with DTC risk, considering polygenic risk score (PRS), Bayesian statistics and a machine learning (ML) classifier to describe case...

Descripción completa

Detalles Bibliográficos
Autores principales: Brigante, Giulia, Lazzaretti, Clara, Paradiso, Elia, Nuzzo, Federico, Sitti, Martina, Tüttelmann, Frank, Moretti, Gabriele, Silvestri, Roberto, Gemignani, Federica, Försti, Asta, Hemminki, Kari, Elisei, Rossella, Romei, Cristina, Zizzi, Eric Adriano, Deriu, Marco Agostino, Simoni, Manuela, Landi, Stefano, Casarini, Livio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Bioscientifica Ltd 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9513665/
https://www.ncbi.nlm.nih.gov/pubmed/35976137
http://dx.doi.org/10.1530/ETJ-22-0058
_version_ 1784798119632830464
author Brigante, Giulia
Lazzaretti, Clara
Paradiso, Elia
Nuzzo, Federico
Sitti, Martina
Tüttelmann, Frank
Moretti, Gabriele
Silvestri, Roberto
Gemignani, Federica
Försti, Asta
Hemminki, Kari
Elisei, Rossella
Romei, Cristina
Zizzi, Eric Adriano
Deriu, Marco Agostino
Simoni, Manuela
Landi, Stefano
Casarini, Livio
author_facet Brigante, Giulia
Lazzaretti, Clara
Paradiso, Elia
Nuzzo, Federico
Sitti, Martina
Tüttelmann, Frank
Moretti, Gabriele
Silvestri, Roberto
Gemignani, Federica
Försti, Asta
Hemminki, Kari
Elisei, Rossella
Romei, Cristina
Zizzi, Eric Adriano
Deriu, Marco Agostino
Simoni, Manuela
Landi, Stefano
Casarini, Livio
author_sort Brigante, Giulia
collection PubMed
description To identify a peculiar genetic combination predisposing to differentiated thyroid carcinoma (DTC), we selected a set of single nucleotide polymorphisms (SNPs) associated with DTC risk, considering polygenic risk score (PRS), Bayesian statistics and a machine learning (ML) classifier to describe cases and controls in three different datasets. Dataset 1 (649 DTC, 431 controls) has been previously genotyped in a genome-wide association study (GWAS) on Italian DTC. Dataset 2 (234 DTC, 101 controls) and dataset 3 (404 DTC, 392 controls) were genotyped. Associations of 171 SNPs reported to predispose to DTC in candidate studies were extracted from the GWAS of dataset 1, followed by replication of SNPs associated with DTC risk (P < 0.05) in dataset 2. The reliability of the identified SNPs was confirmed by PRS and Bayesian statistics after merging the three datasets. SNPs were used to describe the case/control state of individuals by ML classifier. Starting from 171 SNPs associated with DTC, 15 were positive in both datasets 1 and 2. Using these markers, PRS revealed that individuals in the fifth quintile had a seven-fold increased risk of DTC than those in the first. Bayesian inference confirmed that the selected 15 SNPs differentiate cases from controls. Results were corroborated by ML, finding a maximum AUC of about 0.7. A restricted selection of only 15 DTC-associated SNPs is able to describe the inner genetic structure of Italian individuals, and ML allows a fair prediction of case or control status based solely on the individual genetic background.
format Online
Article
Text
id pubmed-9513665
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Bioscientifica Ltd
record_format MEDLINE/PubMed
spelling pubmed-95136652022-09-28 Genetic signature of differentiated thyroid carcinoma susceptibility: a machine learning approach Brigante, Giulia Lazzaretti, Clara Paradiso, Elia Nuzzo, Federico Sitti, Martina Tüttelmann, Frank Moretti, Gabriele Silvestri, Roberto Gemignani, Federica Försti, Asta Hemminki, Kari Elisei, Rossella Romei, Cristina Zizzi, Eric Adriano Deriu, Marco Agostino Simoni, Manuela Landi, Stefano Casarini, Livio Eur Thyroid J Research To identify a peculiar genetic combination predisposing to differentiated thyroid carcinoma (DTC), we selected a set of single nucleotide polymorphisms (SNPs) associated with DTC risk, considering polygenic risk score (PRS), Bayesian statistics and a machine learning (ML) classifier to describe cases and controls in three different datasets. Dataset 1 (649 DTC, 431 controls) has been previously genotyped in a genome-wide association study (GWAS) on Italian DTC. Dataset 2 (234 DTC, 101 controls) and dataset 3 (404 DTC, 392 controls) were genotyped. Associations of 171 SNPs reported to predispose to DTC in candidate studies were extracted from the GWAS of dataset 1, followed by replication of SNPs associated with DTC risk (P < 0.05) in dataset 2. The reliability of the identified SNPs was confirmed by PRS and Bayesian statistics after merging the three datasets. SNPs were used to describe the case/control state of individuals by ML classifier. Starting from 171 SNPs associated with DTC, 15 were positive in both datasets 1 and 2. Using these markers, PRS revealed that individuals in the fifth quintile had a seven-fold increased risk of DTC than those in the first. Bayesian inference confirmed that the selected 15 SNPs differentiate cases from controls. Results were corroborated by ML, finding a maximum AUC of about 0.7. A restricted selection of only 15 DTC-associated SNPs is able to describe the inner genetic structure of Italian individuals, and ML allows a fair prediction of case or control status based solely on the individual genetic background. Bioscientifica Ltd 2022-08-17 /pmc/articles/PMC9513665/ /pubmed/35976137 http://dx.doi.org/10.1530/ETJ-22-0058 Text en © The authors https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License. (https://creativecommons.org/licenses/by/4.0/)
spellingShingle Research
Brigante, Giulia
Lazzaretti, Clara
Paradiso, Elia
Nuzzo, Federico
Sitti, Martina
Tüttelmann, Frank
Moretti, Gabriele
Silvestri, Roberto
Gemignani, Federica
Försti, Asta
Hemminki, Kari
Elisei, Rossella
Romei, Cristina
Zizzi, Eric Adriano
Deriu, Marco Agostino
Simoni, Manuela
Landi, Stefano
Casarini, Livio
Genetic signature of differentiated thyroid carcinoma susceptibility: a machine learning approach
title Genetic signature of differentiated thyroid carcinoma susceptibility: a machine learning approach
title_full Genetic signature of differentiated thyroid carcinoma susceptibility: a machine learning approach
title_fullStr Genetic signature of differentiated thyroid carcinoma susceptibility: a machine learning approach
title_full_unstemmed Genetic signature of differentiated thyroid carcinoma susceptibility: a machine learning approach
title_short Genetic signature of differentiated thyroid carcinoma susceptibility: a machine learning approach
title_sort genetic signature of differentiated thyroid carcinoma susceptibility: a machine learning approach
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9513665/
https://www.ncbi.nlm.nih.gov/pubmed/35976137
http://dx.doi.org/10.1530/ETJ-22-0058
work_keys_str_mv AT brigantegiulia geneticsignatureofdifferentiatedthyroidcarcinomasusceptibilityamachinelearningapproach
AT lazzaretticlara geneticsignatureofdifferentiatedthyroidcarcinomasusceptibilityamachinelearningapproach
AT paradisoelia geneticsignatureofdifferentiatedthyroidcarcinomasusceptibilityamachinelearningapproach
AT nuzzofederico geneticsignatureofdifferentiatedthyroidcarcinomasusceptibilityamachinelearningapproach
AT sittimartina geneticsignatureofdifferentiatedthyroidcarcinomasusceptibilityamachinelearningapproach
AT tuttelmannfrank geneticsignatureofdifferentiatedthyroidcarcinomasusceptibilityamachinelearningapproach
AT morettigabriele geneticsignatureofdifferentiatedthyroidcarcinomasusceptibilityamachinelearningapproach
AT silvestriroberto geneticsignatureofdifferentiatedthyroidcarcinomasusceptibilityamachinelearningapproach
AT gemignanifederica geneticsignatureofdifferentiatedthyroidcarcinomasusceptibilityamachinelearningapproach
AT forstiasta geneticsignatureofdifferentiatedthyroidcarcinomasusceptibilityamachinelearningapproach
AT hemminkikari geneticsignatureofdifferentiatedthyroidcarcinomasusceptibilityamachinelearningapproach
AT eliseirossella geneticsignatureofdifferentiatedthyroidcarcinomasusceptibilityamachinelearningapproach
AT romeicristina geneticsignatureofdifferentiatedthyroidcarcinomasusceptibilityamachinelearningapproach
AT zizziericadriano geneticsignatureofdifferentiatedthyroidcarcinomasusceptibilityamachinelearningapproach
AT deriumarcoagostino geneticsignatureofdifferentiatedthyroidcarcinomasusceptibilityamachinelearningapproach
AT simonimanuela geneticsignatureofdifferentiatedthyroidcarcinomasusceptibilityamachinelearningapproach
AT landistefano geneticsignatureofdifferentiatedthyroidcarcinomasusceptibilityamachinelearningapproach
AT casarinilivio geneticsignatureofdifferentiatedthyroidcarcinomasusceptibilityamachinelearningapproach