Cargando…
Segmentation of lung parenchyma in CT images using CNN trained with the clustering algorithm generated dataset
BACKGROUND: Lung segmentation constitutes a critical procedure for any clinical-decision supporting system aimed to improve the early diagnosis and treatment of lung diseases. Abnormal lungs mainly include lung parenchyma with commonalities on CT images across subjects, diseases and CT scanners, and...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6317251/ https://www.ncbi.nlm.nih.gov/pubmed/30602393 http://dx.doi.org/10.1186/s12938-018-0619-9 |
_version_ | 1783384716978683904 |
---|---|
author | Xu, Mingjie Qi, Shouliang Yue, Yong Teng, Yueyang Xu, Lisheng Yao, Yudong Qian, Wei |
author_facet | Xu, Mingjie Qi, Shouliang Yue, Yong Teng, Yueyang Xu, Lisheng Yao, Yudong Qian, Wei |
author_sort | Xu, Mingjie |
collection | PubMed |
description | BACKGROUND: Lung segmentation constitutes a critical procedure for any clinical-decision supporting system aimed to improve the early diagnosis and treatment of lung diseases. Abnormal lungs mainly include lung parenchyma with commonalities on CT images across subjects, diseases and CT scanners, and lung lesions presenting various appearances. Segmentation of lung parenchyma can help locate and analyze the neighboring lesions, but is not well studied in the framework of machine learning. METHODS: We proposed to segment lung parenchyma using a convolutional neural network (CNN) model. To reduce the workload of manually preparing the dataset for training the CNN, one clustering algorithm based method is proposed firstly. Specifically, after splitting CT slices into image patches, the k-means clustering algorithm with two categories is performed twice using the mean and minimum intensity of image patch, respectively. A cross-shaped verification, a volume intersection, a connected component analysis and a patch expansion are followed to generate final dataset. Secondly, we design a CNN architecture consisting of only one convolutional layer with six kernels, followed by one maximum pooling layer and two fully connected layers. Using the generated dataset, a variety of CNN models are trained and optimized, and their performances are evaluated by eightfold cross-validation. A separate validation experiment is further conducted using a dataset of 201 subjects (4.62 billion patches) with lung cancer or chronic obstructive pulmonary disease, scanned by CT or PET/CT. The segmentation results by our method are compared with those yielded by manual segmentation and some available methods. RESULTS: A total of 121,728 patches are generated to train and validate the CNN models. After the parameter optimization, our CNN model achieves an average F-score of 0.9917 and an area of curve up to 0.9991 for classification of lung parenchyma and non-lung-parenchyma. The obtain model can segment the lung parenchyma accurately for 201 subjects with heterogeneous lung diseases and CT scanners. The overlap ratio between the manual segmentation and the one by our method reaches 0.96. CONCLUSIONS: The results demonstrated that the proposed clustering algorithm based method can generate the training dataset for CNN models. The obtained CNN model can segment lung parenchyma with very satisfactory performance and have the potential to locate and analyze lung lesions. |
format | Online Article Text |
id | pubmed-6317251 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-63172512019-01-08 Segmentation of lung parenchyma in CT images using CNN trained with the clustering algorithm generated dataset Xu, Mingjie Qi, Shouliang Yue, Yong Teng, Yueyang Xu, Lisheng Yao, Yudong Qian, Wei Biomed Eng Online Research BACKGROUND: Lung segmentation constitutes a critical procedure for any clinical-decision supporting system aimed to improve the early diagnosis and treatment of lung diseases. Abnormal lungs mainly include lung parenchyma with commonalities on CT images across subjects, diseases and CT scanners, and lung lesions presenting various appearances. Segmentation of lung parenchyma can help locate and analyze the neighboring lesions, but is not well studied in the framework of machine learning. METHODS: We proposed to segment lung parenchyma using a convolutional neural network (CNN) model. To reduce the workload of manually preparing the dataset for training the CNN, one clustering algorithm based method is proposed firstly. Specifically, after splitting CT slices into image patches, the k-means clustering algorithm with two categories is performed twice using the mean and minimum intensity of image patch, respectively. A cross-shaped verification, a volume intersection, a connected component analysis and a patch expansion are followed to generate final dataset. Secondly, we design a CNN architecture consisting of only one convolutional layer with six kernels, followed by one maximum pooling layer and two fully connected layers. Using the generated dataset, a variety of CNN models are trained and optimized, and their performances are evaluated by eightfold cross-validation. A separate validation experiment is further conducted using a dataset of 201 subjects (4.62 billion patches) with lung cancer or chronic obstructive pulmonary disease, scanned by CT or PET/CT. The segmentation results by our method are compared with those yielded by manual segmentation and some available methods. RESULTS: A total of 121,728 patches are generated to train and validate the CNN models. After the parameter optimization, our CNN model achieves an average F-score of 0.9917 and an area of curve up to 0.9991 for classification of lung parenchyma and non-lung-parenchyma. The obtain model can segment the lung parenchyma accurately for 201 subjects with heterogeneous lung diseases and CT scanners. The overlap ratio between the manual segmentation and the one by our method reaches 0.96. CONCLUSIONS: The results demonstrated that the proposed clustering algorithm based method can generate the training dataset for CNN models. The obtained CNN model can segment lung parenchyma with very satisfactory performance and have the potential to locate and analyze lung lesions. BioMed Central 2019-01-03 /pmc/articles/PMC6317251/ /pubmed/30602393 http://dx.doi.org/10.1186/s12938-018-0619-9 Text en © The Author(s) 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Xu, Mingjie Qi, Shouliang Yue, Yong Teng, Yueyang Xu, Lisheng Yao, Yudong Qian, Wei Segmentation of lung parenchyma in CT images using CNN trained with the clustering algorithm generated dataset |
title | Segmentation of lung parenchyma in CT images using CNN trained with the clustering algorithm generated dataset |
title_full | Segmentation of lung parenchyma in CT images using CNN trained with the clustering algorithm generated dataset |
title_fullStr | Segmentation of lung parenchyma in CT images using CNN trained with the clustering algorithm generated dataset |
title_full_unstemmed | Segmentation of lung parenchyma in CT images using CNN trained with the clustering algorithm generated dataset |
title_short | Segmentation of lung parenchyma in CT images using CNN trained with the clustering algorithm generated dataset |
title_sort | segmentation of lung parenchyma in ct images using cnn trained with the clustering algorithm generated dataset |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6317251/ https://www.ncbi.nlm.nih.gov/pubmed/30602393 http://dx.doi.org/10.1186/s12938-018-0619-9 |
work_keys_str_mv | AT xumingjie segmentationoflungparenchymainctimagesusingcnntrainedwiththeclusteringalgorithmgenerateddataset AT qishouliang segmentationoflungparenchymainctimagesusingcnntrainedwiththeclusteringalgorithmgenerateddataset AT yueyong segmentationoflungparenchymainctimagesusingcnntrainedwiththeclusteringalgorithmgenerateddataset AT tengyueyang segmentationoflungparenchymainctimagesusingcnntrainedwiththeclusteringalgorithmgenerateddataset AT xulisheng segmentationoflungparenchymainctimagesusingcnntrainedwiththeclusteringalgorithmgenerateddataset AT yaoyudong segmentationoflungparenchymainctimagesusingcnntrainedwiththeclusteringalgorithmgenerateddataset AT qianwei segmentationoflungparenchymainctimagesusingcnntrainedwiththeclusteringalgorithmgenerateddataset |