Cargando…

Automated segmentation of magnetic resonance bone marrow signal: a feasibility study

BACKGROUND: Manual assessment of bone marrow signal is time-consuming and requires meticulous standardisation to secure adequate precision of findings. OBJECTIVE: We examined the feasibility of using deep learning for automated segmentation of bone marrow signal in children and adolescents. MATERIAL...

Descripción completa

Detalles Bibliográficos
Autores principales: von Brandis, Elisabeth, Jenssen, Håvard B., Avenarius, Derk F. M., Bjørnerud, Atle, Flatø, Berit, Tomterstad, Anders H., Lilleby, Vibke, Rosendahl, Karen, Sakinis, Tomas, Zadig, Pia K. K., Müller, Lil-Sofie Ording
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9107442/
https://www.ncbi.nlm.nih.gov/pubmed/35107593
http://dx.doi.org/10.1007/s00247-021-05270-x
Descripción
Sumario:BACKGROUND: Manual assessment of bone marrow signal is time-consuming and requires meticulous standardisation to secure adequate precision of findings. OBJECTIVE: We examined the feasibility of using deep learning for automated segmentation of bone marrow signal in children and adolescents. MATERIALS AND METHODS: We selected knee images from 95 whole-body MRI examinations of healthy individuals and of children with chronic non-bacterial osteomyelitis, ages 6–18 years, in a longitudinal prospective multi-centre study cohort. Bone marrow signal on T2-weighted Dixon water-only images was divided into three color-coded intensity-levels: 1 = slightly increased; 2 = mildly increased; 3 = moderately to highly increased, up to fluid-like signal. We trained a convolutional neural network on 85 examinations to perform bone marrow segmentation. Four readers manually segmented a test set of 10 examinations and calculated ground truth using simultaneous truth and performance level estimation (STAPLE). We evaluated model and rater performance through Dice similarity coefficient and in consensus. RESULTS: Consensus score of model performance showed acceptable results for all but one examination. Model performance and reader agreement had highest scores for level-1 signal (median Dice 0.68) and lowest scores for level-3 signal (median Dice 0.40), particularly in examinations where this signal was sparse. CONCLUSION: It is feasible to develop a deep-learning-based model for automated segmentation of bone marrow signal in children and adolescents. Our model performed poorest for the highest signal intensity in examinations where this signal was sparse. Further improvement requires training on larger and more balanced datasets and validation against ground truth, which should be established by radiologists from several institutions in consensus. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s00247-021-05270-x.