DOI QR코드

DOI QR Code

Ensemble UNet 3+ for Medical Image Segmentation

  • Received : 2023.02.02
  • Accepted : 2023.02.07
  • Published : 2023.02.28

Abstract

In this paper, we proposed a new UNet 3+ model for medical image segmentation. The proposed ensemble(E) UNet 3+ model consists of UNet 3+s of varying depths into one unified architecture. UNet 3+s of varying depths have same encoder, but have their own decoders. They can bridge semantic gap between encoder and decoder nodes of UNet 3+. Deep supervision was used for learning on a total of 8 nodes of the E-UNet 3+ to improve performance. The proposed E-UNet 3+ model shows better segmentation results than those of the UNet 3+. As a result of the simulation, the E-UNet 3+ model using deep supervision was the best with loss function values of 0.8904 and 0.8562 for training and validation data. For the test data, the UNet 3+ model using deep supervision was the best with a value of 0.7406. Qualitative comparison of the simulation results shows the results of the proposed model are better than those of existing UNet 3+.

Keywords

References

  1. Z. W. Zhou, M. R. Siddiquee, N. Tajbakhsh, J. Liang, "UNet++: Redesigning Skip Connections to Exploit Multiscale Features in Image Segmentation", IEEE Trans Med Imaging, 2020 June; 39(6): 1856-1867. DOI:https://doi.org/10.1109/TMI.2019.2959609 
  2. J. Long, E. Shelhamer and T. Darrell, "Fully Convolutional Networks for Semantic Segmentation," The IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431-3440, 2015. DOI:https://doi.org/10.48550/arXiv.1411.4038 
  3. O. Ronneberger, P. Fischer, and T. Brox, "U-net: Convolutional networks for biomedical image segmentation," International Conference on Medical image computing and computer-assisted intervention, pp. 234-241. Springer, 2015. DOI-https://doi.org/10.48550/arXiv.1505.04597 
  4. Z.W. Zhou, M.M.R. Siddiquee, N. Tajbakhsh and J.M. Liang, "UNet++: A Nested U-Net Architecture for Medical Image Segmentation," Deep Learning in Medical Image Anylysis and Multimodal Learning for Clinical Decision Support, pp: 3-11, 2018. DOI-https://doi.org/10.48550/arXiv.1807.10165 
  5. Huimin Huang, et al., "Unet 3+: A full-scale connected unet for medical image segmentation", 2020 IEEE International Conference on Acoustics, Speech and Signal Processing(ICASSP), pp. 1055-1059. IEEE, 2020. DOI- https://doi.org/10.48550/arXiv.2004.08790 
  6. Z. Wang, E.P. Simoncelli and A.C. Bovik, "Multiscale structural similarity for image quality assessment," The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003. DOI- https://doi.org/10.1109/ACSSC.2003.1292216 
  7. O.O et al., "Attention u-net: Learning where to look for the pancreas," Medical Imaging with Deep Learning, 2018. DOI- https://doi.org/10.48550/arXiv.1804.03999 
  8. T.-Y. Lin, P. Goyal, R. Girshick, K.M. He and P. Dollar. "Focal loss for dense object detection," The IEEE international conference on computer vision, pp. 2980-2988, 2017. DOI- https://doi.org/10.48550/arXiv.1708.02002