Cargando…

Development and validation of a case-finding algorithm for the identification of non-small cell lung cancers in a region-wide Italian pathology registry

PURPOSE: To develop and validate a case-finding algorithm for the identification of Non-Small Cell Lung Cancer (NSCLC) cases in a region-wide Italian pathology registry (PR). MATERIALS AND METHODS: Data collected between 2009 and 2017 in the PR and the Pharmacy Database of the University Hospital of...

Descripción completa

Detalles Bibliográficos
Autores principales: Spini, Andrea, Rosellini, Pietro, Bellan, Cristiana, Furiesi, Folco, Giorgi, Silvano, Donnini, Sandra, Gini, Rosa, Ziche, Marina, Salvo, Francesco, Roberto, Giuseppe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9176782/
https://www.ncbi.nlm.nih.gov/pubmed/35675338
http://dx.doi.org/10.1371/journal.pone.0269232
_version_ 1784722743945592832
author Spini, Andrea
Rosellini, Pietro
Bellan, Cristiana
Furiesi, Folco
Giorgi, Silvano
Donnini, Sandra
Gini, Rosa
Ziche, Marina
Salvo, Francesco
Roberto, Giuseppe
author_facet Spini, Andrea
Rosellini, Pietro
Bellan, Cristiana
Furiesi, Folco
Giorgi, Silvano
Donnini, Sandra
Gini, Rosa
Ziche, Marina
Salvo, Francesco
Roberto, Giuseppe
author_sort Spini, Andrea
collection PubMed
description PURPOSE: To develop and validate a case-finding algorithm for the identification of Non-Small Cell Lung Cancer (NSCLC) cases in a region-wide Italian pathology registry (PR). MATERIALS AND METHODS: Data collected between 2009 and 2017 in the PR and the Pharmacy Database of the University Hospital of Siena and the PR of Tuscany region were used. A NSCLC-identification algorithm based on free-text keywords and SNOMED morphology and topography codes was designed and tested on data from Siena: indication for drug use (i.e. NSCLC) was the reference standard for sensitivity (SE); positive predictive value (PPV) was estimated through manual review. Algorithm modifications were then tested to improve algorithm performance: PPV was calculated against validated dataset from PR of Siena; a range of SE [min-max] was estimated in PR of Tuscany using analytical formulae that assumed NSCLC incidence equal either to 80% or 90% of overall lung cancer incidence recorded in Tuscany. The algorithm modification with the best performance was chosen as the final version of the algorithm. A random sample of 200 cases was extracted from the PR of Tuscany for manual review. RESULTS: The first version of the algorithm showed a PPV of 74.7% and SE of 79% in PR of Siena. The final version of the algorithm had a SE in PR of Tuscany that grew with calendar time (2009 = [24.7%-28%]; 2017 = [57.9%-65.1%]) and a PPV of 93%. CONCLUSIONS: The final NSCLC-finding algorithm showed with very high PPV. SE was in line with the expected contribution of PR to overall cases captured in the regional Cancer Registry, with a trend of increase over calendar time. Given the promising algorithm validity and the wide use of SNOMED terminology in electronic pathology records, the proposed algorithm is expected to be easily adapted to other electronic databases for (pharmaco)epidemiology purposes.
format Online
Article
Text
id pubmed-9176782
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-91767822022-06-09 Development and validation of a case-finding algorithm for the identification of non-small cell lung cancers in a region-wide Italian pathology registry Spini, Andrea Rosellini, Pietro Bellan, Cristiana Furiesi, Folco Giorgi, Silvano Donnini, Sandra Gini, Rosa Ziche, Marina Salvo, Francesco Roberto, Giuseppe PLoS One Research Article PURPOSE: To develop and validate a case-finding algorithm for the identification of Non-Small Cell Lung Cancer (NSCLC) cases in a region-wide Italian pathology registry (PR). MATERIALS AND METHODS: Data collected between 2009 and 2017 in the PR and the Pharmacy Database of the University Hospital of Siena and the PR of Tuscany region were used. A NSCLC-identification algorithm based on free-text keywords and SNOMED morphology and topography codes was designed and tested on data from Siena: indication for drug use (i.e. NSCLC) was the reference standard for sensitivity (SE); positive predictive value (PPV) was estimated through manual review. Algorithm modifications were then tested to improve algorithm performance: PPV was calculated against validated dataset from PR of Siena; a range of SE [min-max] was estimated in PR of Tuscany using analytical formulae that assumed NSCLC incidence equal either to 80% or 90% of overall lung cancer incidence recorded in Tuscany. The algorithm modification with the best performance was chosen as the final version of the algorithm. A random sample of 200 cases was extracted from the PR of Tuscany for manual review. RESULTS: The first version of the algorithm showed a PPV of 74.7% and SE of 79% in PR of Siena. The final version of the algorithm had a SE in PR of Tuscany that grew with calendar time (2009 = [24.7%-28%]; 2017 = [57.9%-65.1%]) and a PPV of 93%. CONCLUSIONS: The final NSCLC-finding algorithm showed with very high PPV. SE was in line with the expected contribution of PR to overall cases captured in the regional Cancer Registry, with a trend of increase over calendar time. Given the promising algorithm validity and the wide use of SNOMED terminology in electronic pathology records, the proposed algorithm is expected to be easily adapted to other electronic databases for (pharmaco)epidemiology purposes. Public Library of Science 2022-06-08 /pmc/articles/PMC9176782/ /pubmed/35675338 http://dx.doi.org/10.1371/journal.pone.0269232 Text en © 2022 Spini et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Spini, Andrea
Rosellini, Pietro
Bellan, Cristiana
Furiesi, Folco
Giorgi, Silvano
Donnini, Sandra
Gini, Rosa
Ziche, Marina
Salvo, Francesco
Roberto, Giuseppe
Development and validation of a case-finding algorithm for the identification of non-small cell lung cancers in a region-wide Italian pathology registry
title Development and validation of a case-finding algorithm for the identification of non-small cell lung cancers in a region-wide Italian pathology registry
title_full Development and validation of a case-finding algorithm for the identification of non-small cell lung cancers in a region-wide Italian pathology registry
title_fullStr Development and validation of a case-finding algorithm for the identification of non-small cell lung cancers in a region-wide Italian pathology registry
title_full_unstemmed Development and validation of a case-finding algorithm for the identification of non-small cell lung cancers in a region-wide Italian pathology registry
title_short Development and validation of a case-finding algorithm for the identification of non-small cell lung cancers in a region-wide Italian pathology registry
title_sort development and validation of a case-finding algorithm for the identification of non-small cell lung cancers in a region-wide italian pathology registry
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9176782/
https://www.ncbi.nlm.nih.gov/pubmed/35675338
http://dx.doi.org/10.1371/journal.pone.0269232
work_keys_str_mv AT spiniandrea developmentandvalidationofacasefindingalgorithmfortheidentificationofnonsmallcelllungcancersinaregionwideitalianpathologyregistry
AT rosellinipietro developmentandvalidationofacasefindingalgorithmfortheidentificationofnonsmallcelllungcancersinaregionwideitalianpathologyregistry
AT bellancristiana developmentandvalidationofacasefindingalgorithmfortheidentificationofnonsmallcelllungcancersinaregionwideitalianpathologyregistry
AT furiesifolco developmentandvalidationofacasefindingalgorithmfortheidentificationofnonsmallcelllungcancersinaregionwideitalianpathologyregistry
AT giorgisilvano developmentandvalidationofacasefindingalgorithmfortheidentificationofnonsmallcelllungcancersinaregionwideitalianpathologyregistry
AT donninisandra developmentandvalidationofacasefindingalgorithmfortheidentificationofnonsmallcelllungcancersinaregionwideitalianpathologyregistry
AT ginirosa developmentandvalidationofacasefindingalgorithmfortheidentificationofnonsmallcelllungcancersinaregionwideitalianpathologyregistry
AT zichemarina developmentandvalidationofacasefindingalgorithmfortheidentificationofnonsmallcelllungcancersinaregionwideitalianpathologyregistry
AT salvofrancesco developmentandvalidationofacasefindingalgorithmfortheidentificationofnonsmallcelllungcancersinaregionwideitalianpathologyregistry
AT robertogiuseppe developmentandvalidationofacasefindingalgorithmfortheidentificationofnonsmallcelllungcancersinaregionwideitalianpathologyregistry