Cargando…

Browser-based Data Annotation, Active Learning, and Real-Time Distribution of Artificial Intelligence Models: From Tumor Tissue Microarrays to COVID-19 Radiology

BACKGROUND: Artificial intelligence (AI) is fast becoming the tool of choice for scalable and reliable analysis of medical images. However, constraints in sharing medical data outside the institutional or geographical space, as well as difficulties in getting AI models and modeling platforms to work...

Descripción completa

Detalles Bibliográficos
Autores principales: Bhawsar, Praphulla M. S., Abubakar, Mustapha, Schmidt, Marjanka K., Camp, Nicola J., Cessna, Melissa H., Duggan, Máire A., García-Closas, Montserrat, Almeida, Jonas S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Wolters Kluwer - Medknow 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8546359/
https://www.ncbi.nlm.nih.gov/pubmed/34760334
http://dx.doi.org/10.4103/jpi.jpi_100_20
_version_ 1784590179297656832
author Bhawsar, Praphulla M. S.
Abubakar, Mustapha
Schmidt, Marjanka K.
Camp, Nicola J.
Cessna, Melissa H.
Duggan, Máire A.
García-Closas, Montserrat
Almeida, Jonas S.
author_facet Bhawsar, Praphulla M. S.
Abubakar, Mustapha
Schmidt, Marjanka K.
Camp, Nicola J.
Cessna, Melissa H.
Duggan, Máire A.
García-Closas, Montserrat
Almeida, Jonas S.
author_sort Bhawsar, Praphulla M. S.
collection PubMed
description BACKGROUND: Artificial intelligence (AI) is fast becoming the tool of choice for scalable and reliable analysis of medical images. However, constraints in sharing medical data outside the institutional or geographical space, as well as difficulties in getting AI models and modeling platforms to work across different environments, have led to a “reproducibility crisis” in digital medicine. METHODS: This study details the implementation of a web platform that can be used to mitigate these challenges by orchestrating a digital pathology AI pipeline, from raw data to model inference, entirely on the local machine. We discuss how this federated platform provides governed access to data by consuming the Application Program Interfaces exposed by cloud storage services, allows the addition of user-defined annotations, facilitates active learning for training models iteratively, and provides model inference computed directly in the web browser at practically zero cost. The latter is of particular relevance to clinical workflows because the code, including the AI model, travels to the user’s data, which stays private to the governance domain where it was acquired RESULTS: We demonstrate that the web browser can be a means of democratizing AI and advancing data socialization in medical imaging backed by consumer-facing cloud infrastructure such as Box.com. As a case study, we test the accompanying platform end-to-end on a large dataset of digital breast cancer tissue microarray core images. We also showcase how it can be applied in contexts separate from digital pathology by applying it to a radiology dataset containing COVID-19 computed tomography images. CONCLUSIONS: The platform described in this report resolves the challenges to the findable, accessible, interoperable, reusable stewardship of data and AI models by integrating with cloud storage to maintain user-centric governance over the data. It also enables distributed, federated computation for AI inference over those data and proves the viability of client-side AI in medical imaging. AVAILABILITY: The open-source application is publicly available at , with a short video demonstration at .
format Online
Article
Text
id pubmed-8546359
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Wolters Kluwer - Medknow
record_format MEDLINE/PubMed
spelling pubmed-85463592021-11-09 Browser-based Data Annotation, Active Learning, and Real-Time Distribution of Artificial Intelligence Models: From Tumor Tissue Microarrays to COVID-19 Radiology Bhawsar, Praphulla M. S. Abubakar, Mustapha Schmidt, Marjanka K. Camp, Nicola J. Cessna, Melissa H. Duggan, Máire A. García-Closas, Montserrat Almeida, Jonas S. J Pathol Inform Research Article BACKGROUND: Artificial intelligence (AI) is fast becoming the tool of choice for scalable and reliable analysis of medical images. However, constraints in sharing medical data outside the institutional or geographical space, as well as difficulties in getting AI models and modeling platforms to work across different environments, have led to a “reproducibility crisis” in digital medicine. METHODS: This study details the implementation of a web platform that can be used to mitigate these challenges by orchestrating a digital pathology AI pipeline, from raw data to model inference, entirely on the local machine. We discuss how this federated platform provides governed access to data by consuming the Application Program Interfaces exposed by cloud storage services, allows the addition of user-defined annotations, facilitates active learning for training models iteratively, and provides model inference computed directly in the web browser at practically zero cost. The latter is of particular relevance to clinical workflows because the code, including the AI model, travels to the user’s data, which stays private to the governance domain where it was acquired RESULTS: We demonstrate that the web browser can be a means of democratizing AI and advancing data socialization in medical imaging backed by consumer-facing cloud infrastructure such as Box.com. As a case study, we test the accompanying platform end-to-end on a large dataset of digital breast cancer tissue microarray core images. We also showcase how it can be applied in contexts separate from digital pathology by applying it to a radiology dataset containing COVID-19 computed tomography images. CONCLUSIONS: The platform described in this report resolves the challenges to the findable, accessible, interoperable, reusable stewardship of data and AI models by integrating with cloud storage to maintain user-centric governance over the data. It also enables distributed, federated computation for AI inference over those data and proves the viability of client-side AI in medical imaging. AVAILABILITY: The open-source application is publicly available at , with a short video demonstration at . Wolters Kluwer - Medknow 2021-09-27 /pmc/articles/PMC8546359/ /pubmed/34760334 http://dx.doi.org/10.4103/jpi.jpi_100_20 Text en Copyright: © 2021 Journal of Pathology Informatics https://creativecommons.org/licenses/by-nc-sa/4.0/This is an open access journal, and articles are distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License, which allows others to remix, tweak, and build upon the work non-commercially, as long as appropriate credit is given and the new creations are licensed under the identical terms.
spellingShingle Research Article
Bhawsar, Praphulla M. S.
Abubakar, Mustapha
Schmidt, Marjanka K.
Camp, Nicola J.
Cessna, Melissa H.
Duggan, Máire A.
García-Closas, Montserrat
Almeida, Jonas S.
Browser-based Data Annotation, Active Learning, and Real-Time Distribution of Artificial Intelligence Models: From Tumor Tissue Microarrays to COVID-19 Radiology
title Browser-based Data Annotation, Active Learning, and Real-Time Distribution of Artificial Intelligence Models: From Tumor Tissue Microarrays to COVID-19 Radiology
title_full Browser-based Data Annotation, Active Learning, and Real-Time Distribution of Artificial Intelligence Models: From Tumor Tissue Microarrays to COVID-19 Radiology
title_fullStr Browser-based Data Annotation, Active Learning, and Real-Time Distribution of Artificial Intelligence Models: From Tumor Tissue Microarrays to COVID-19 Radiology
title_full_unstemmed Browser-based Data Annotation, Active Learning, and Real-Time Distribution of Artificial Intelligence Models: From Tumor Tissue Microarrays to COVID-19 Radiology
title_short Browser-based Data Annotation, Active Learning, and Real-Time Distribution of Artificial Intelligence Models: From Tumor Tissue Microarrays to COVID-19 Radiology
title_sort browser-based data annotation, active learning, and real-time distribution of artificial intelligence models: from tumor tissue microarrays to covid-19 radiology
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8546359/
https://www.ncbi.nlm.nih.gov/pubmed/34760334
http://dx.doi.org/10.4103/jpi.jpi_100_20
work_keys_str_mv AT bhawsarpraphullams browserbaseddataannotationactivelearningandrealtimedistributionofartificialintelligencemodelsfromtumortissuemicroarraystocovid19radiology
AT abubakarmustapha browserbaseddataannotationactivelearningandrealtimedistributionofartificialintelligencemodelsfromtumortissuemicroarraystocovid19radiology
AT schmidtmarjankak browserbaseddataannotationactivelearningandrealtimedistributionofartificialintelligencemodelsfromtumortissuemicroarraystocovid19radiology
AT campnicolaj browserbaseddataannotationactivelearningandrealtimedistributionofartificialintelligencemodelsfromtumortissuemicroarraystocovid19radiology
AT cessnamelissah browserbaseddataannotationactivelearningandrealtimedistributionofartificialintelligencemodelsfromtumortissuemicroarraystocovid19radiology
AT dugganmairea browserbaseddataannotationactivelearningandrealtimedistributionofartificialintelligencemodelsfromtumortissuemicroarraystocovid19radiology
AT garciaclosasmontserrat browserbaseddataannotationactivelearningandrealtimedistributionofartificialintelligencemodelsfromtumortissuemicroarraystocovid19radiology
AT almeidajonass browserbaseddataannotationactivelearningandrealtimedistributionofartificialintelligencemodelsfromtumortissuemicroarraystocovid19radiology