Cargando…

Reducing False-Positives in Lung Nodules Detection Using Balanced Datasets

Malignant pulmonary nodules are one of the main manifestations of lung cancer in early CT image screening. Since lung cancer may have no early obvious symptoms, it is important to develop a computer-aided detection (CAD) system to assist doctors to detect the malignant pulmonary nodules in the early...

Descripción completa

Detalles Bibliográficos
Autores principales: Liang, Jinglun, Ye, Guoliang, Guo, Jianwen, Huang, Qifan, Zhang, Shaohui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8170487/
https://www.ncbi.nlm.nih.gov/pubmed/34095073
http://dx.doi.org/10.3389/fpubh.2021.671070
_version_ 1783702257304338432
author Liang, Jinglun
Ye, Guoliang
Guo, Jianwen
Huang, Qifan
Zhang, Shaohui
author_facet Liang, Jinglun
Ye, Guoliang
Guo, Jianwen
Huang, Qifan
Zhang, Shaohui
author_sort Liang, Jinglun
collection PubMed
description Malignant pulmonary nodules are one of the main manifestations of lung cancer in early CT image screening. Since lung cancer may have no early obvious symptoms, it is important to develop a computer-aided detection (CAD) system to assist doctors to detect the malignant pulmonary nodules in the early stage of lung cancer CT diagnosis. Due to the recent successful applications of deep learning in image processing, more and more researchers have been trying to apply it to the diagnosis of pulmonary nodules. However, due to the ratio of nodules and non-nodules samples used in the training and testing datasets usually being different from the practical ratio of lung cancer, the CAD classification systems may easily produce higher false-positives while using this imbalanced dataset. This work introduces a filtering step to remove the irrelevant images from the dataset, and the results show that the false-positives can be reduced and the accuracy can be above 98%. There are two steps in nodule detection. Firstly, the images with pulmonary nodules are screened from the whole lung CT images of the patients. Secondly, the exact locations of pulmonary nodules will be detected using Faster R-CNN. Final results show that this method can effectively detect the pulmonary nodules in the CT images and hence potentially assist doctors in the early diagnosis of lung cancer.
format Online
Article
Text
id pubmed-8170487
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-81704872021-06-03 Reducing False-Positives in Lung Nodules Detection Using Balanced Datasets Liang, Jinglun Ye, Guoliang Guo, Jianwen Huang, Qifan Zhang, Shaohui Front Public Health Public Health Malignant pulmonary nodules are one of the main manifestations of lung cancer in early CT image screening. Since lung cancer may have no early obvious symptoms, it is important to develop a computer-aided detection (CAD) system to assist doctors to detect the malignant pulmonary nodules in the early stage of lung cancer CT diagnosis. Due to the recent successful applications of deep learning in image processing, more and more researchers have been trying to apply it to the diagnosis of pulmonary nodules. However, due to the ratio of nodules and non-nodules samples used in the training and testing datasets usually being different from the practical ratio of lung cancer, the CAD classification systems may easily produce higher false-positives while using this imbalanced dataset. This work introduces a filtering step to remove the irrelevant images from the dataset, and the results show that the false-positives can be reduced and the accuracy can be above 98%. There are two steps in nodule detection. Firstly, the images with pulmonary nodules are screened from the whole lung CT images of the patients. Secondly, the exact locations of pulmonary nodules will be detected using Faster R-CNN. Final results show that this method can effectively detect the pulmonary nodules in the CT images and hence potentially assist doctors in the early diagnosis of lung cancer. Frontiers Media S.A. 2021-05-19 /pmc/articles/PMC8170487/ /pubmed/34095073 http://dx.doi.org/10.3389/fpubh.2021.671070 Text en Copyright © 2021 Liang, Ye, Guo, Huang and Zhang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Public Health
Liang, Jinglun
Ye, Guoliang
Guo, Jianwen
Huang, Qifan
Zhang, Shaohui
Reducing False-Positives in Lung Nodules Detection Using Balanced Datasets
title Reducing False-Positives in Lung Nodules Detection Using Balanced Datasets
title_full Reducing False-Positives in Lung Nodules Detection Using Balanced Datasets
title_fullStr Reducing False-Positives in Lung Nodules Detection Using Balanced Datasets
title_full_unstemmed Reducing False-Positives in Lung Nodules Detection Using Balanced Datasets
title_short Reducing False-Positives in Lung Nodules Detection Using Balanced Datasets
title_sort reducing false-positives in lung nodules detection using balanced datasets
topic Public Health
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8170487/
https://www.ncbi.nlm.nih.gov/pubmed/34095073
http://dx.doi.org/10.3389/fpubh.2021.671070
work_keys_str_mv AT liangjinglun reducingfalsepositivesinlungnodulesdetectionusingbalanceddatasets
AT yeguoliang reducingfalsepositivesinlungnodulesdetectionusingbalanceddatasets
AT guojianwen reducingfalsepositivesinlungnodulesdetectionusingbalanceddatasets
AT huangqifan reducingfalsepositivesinlungnodulesdetectionusingbalanceddatasets
AT zhangshaohui reducingfalsepositivesinlungnodulesdetectionusingbalanceddatasets