DOI QR코드

DOI QR Code

Multiple Mixed Modes: Single-Channel Blind Image Separation

  • Tiantian Yin (Electronics and Communication Engineering, Taiyuan University of Science and Technology) ;
  • Yina Guo (Electronics and Communication Engineering, Taiyuan University of Science and Technology) ;
  • Ningning Zhang (Upgrading Office of Modern College of Humanities and Sciences of Shanxi Normal University)
  • 투고 : 2022.06.16
  • 심사 : 2023.01.21
  • 발행 : 2023.12.31

초록

As one of the pivotal techniques of image restoration, single-channel blind source separation (SCBSS) is capable of converting a visual-only image into multi-source images. However, image degradation often results from multiple mixing methods. Therefore, this paper introduces an innovative SCBSS algorithm to effectively separate source images from a composite image in various mixed modes. The cornerstone of this approach is a novel triple generative adversarial network (TriGAN), designed based on dual learning principles. The TriGAN redefines the discriminator's function to optimize the separation process. Extensive experiments have demonstrated the algorithm's capability to distinctly separate source images from a composite image in diverse mixed modes and to facilitate effective image restoration. The effectiveness of the proposed method is quantitatively supported by achieving an average peak signal-to-noise ratio exceeding 30 dB, and the average structural similarity index surpassing 0.95 across multiple datasets.

키워드

과제정보

This paper is funded by National Natural Science Foundation of China (Grant No. 61301250), China Scholarship Council (Grant No. [2020]1417), Key Research and Development Project of Shanxi Province (Grant No. 201803D421035), Natural Science Foundation for Young Scientists of Shanxi Province (Grant No. 201901D211313), Shanxi Scholarship Council of China (Grant No. HGKY2019080 and 2020-127), Scientific and Technological Innovation Programs of Higher Education Institutions in Shanxi (STIP) (Grant No. 2022L641), and Shanxi Province Postgraduate Excellent Innovation Project Plan (Grant No. 2021Y679).

참고문헌

  1. W. Fu, X. Zhou, and B. Nong, "The research of SCBSS technology: survey and prospect," Journal of Beijing University of Posts and Telecommunications, vol. 40, no. 5, pp. 1-11, 2017.
  2. R. Hephzibah, H. C. Anandharaj, G. Kowsalya, R. Jayanthi, and D. A. Chandy, "Review on deep learning methodologies in medical image restoration and segmentation," Current Medical Imaging, vol. 19, no. 8, pp. 844-854, 2023. https://doi.org/10.2174/1573405618666220407112825
  3. G. Liu, W. Tian, Y. Luo, J. Zou, and S. Tang, "A windowed-total-variation regularization constraint model for blind image restoration," Journal of Information Processing Systems, vol. 18, no. 1, pp. 48-58, 2022. https://doi.org/10.3745/JIPS.04.0233
  4. J. Han, Y. Shi, Y. Long, and J. Liang, "Exploring single channel speech separation for short-time textdependent speaker verification," International Journal of Speech Technology, vol. 25, pp. 261-268, 2022. https://doi.org/10.1007/s10772-022-09959-8
  5. W. H. Lai and S. L. Wang, "RPCA-DRNN technique for monaural singing voice separation," EURASIP Journal on Audio, Speech, and Music Processing, vol. 2022, article no. 4, 2022. https://doi.org/10.1186/s13636-022-00236-9
  6. B. Wu, T. Cheng, T. L. Yip, and Y. Wang, "Fuzzy logic based dynamic decision-making system for intelligent navigation strategy within inland traffic separation schemes," Ocean Engineering, vol. 197, article no. 106909, 2020. https://doi.org/10.1016/j.oceaneng.2019.106909
  7. D. Kitamura, H. Saruwatari, K. Shikano, K. Kondo, and Y. Takahashi, "Music signal separation by supervised nonnegative matrix factorization with basis deformation," in Proceedings of 2013 18th International Conference on Digital Signal Processing (DSP), Fira, Greece, 2013, pp. 1-6. https://doi.org/10.1109/ICDSP.2013.6622812
  8. P. Dey, U. Satija, and B. Ramkumar, "Single channel blind source separation based on variational mode decomposition and PCA," in Proceedings of 2015 Annual IEEE India Conference (INDICON), New Delhi, India, 2015, pp. 1-5. https://doi.org/10.1109/INDICON.2015.7443723
  9. Z. Zhang, "Image restoration based on principal component analysis," Journal of Changchun Normal University (Natural Science Edition), vol. 38, no. 6, pp. 21-23, 29, 2019. https://doi.org/10.3969/j.issn.1008-178X.2019.06.005
  10. Z. C. Fan, Y. L. Lai, and J. S. R. Jang, "SVSGAN: singing voice separation via generative adversarial network," in Proceedings of 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, Canada, 2018, pp. 726-730. https://doi.org/10.1109/ICASSP.2018.8462091
  11. D. Stoller, S. Ewert, and S. Dixon, "Adversarial semi-supervised audio source separation applied to singing voice extraction," in Proceedings of 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, Canada, 2018, pp. 2391-2395. https://doi.org/10.1109/ICASSP.2018.8461722
  12. X. Lin and Y. Gao, "An improved resampling particle filter blind separation algorithm," Journal of Chongqing University of Posts and Telecommunications (Natural Science Edition), vol. 31, no. 4, pp. 502-508, 2019. https://doi.org/10.3979/j.issn.1673-825X.2019.04.010
  13. K. Zhang, W. Zuo, Y. Chen, D. Meng, and L. Zhang, "Beyond a gaussian denoiser: residual learning of deep CNN for image denoising," IEEE Transactions on Image Processing, vol. 26, no. 7, pp. 3142-3155, 2017. https://doi.org/10.1109/TIP.2017.2662206
  14. E. M. Grais, G. Roma, A. J. Simpson, and M. D. Plumbley, "Two-stage single-channel audio source separation using deep neural networks," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 25, no. 9, pp. 1773-1783, 2017. https://doi.org/10.1109/TASLP.2017.2716443
  15. C. Zheng, X. Zhang, T. Cao, J. Yang, M. Sun, and Y. Xing, "Blind enhancement algorithm for throat microphone speech based on LSTM recurrent neural networks," Journal of Data Acquisition and Processing, vol. 34, no. 4, pp. 615-624, 2019. https://doi.org/10.16337/j.1004-9037.2019.04.005
  16. T. Wang, C. Qiu, J. Yu, and J. Xu, "Natural image classification based on improved SAE network," Information Technology, vol. 2016, no. 8, pp. 1-4,8, 2016. https://doi.org/10.13274/j.cnki.hdzj.2016.08.001
  17. M. Mirza and S. Osindero, "Conditional generative adversarial nets," 2014 [Online]. Available: https://arxiv.org/abs/1411.1784.
  18. Y. C. Subakan and P. Smaragdis, "Generative adversarial source separation," in Proceedings of 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, Canada, 2018, pp. 26-30. https://doi.org/10.1109/ICASSP.2018.8461671
  19. Q. Kong, Y. Xu, W. Wang, P. J. Jackson, and M. D. Plumbley, "Single-channel signal separation and deconvolution with generative adversarial networks," 2019 [Online]. Available: https://arxiv.org/abs/1906.07552.
  20. T. Liu, T. Yin, Z. Gong, and Y. Guo, "Research on single-channel blind deconvolution algorithm for multisource signals," Journal of Electronics & Information Technology, vol. 44, no. 1, pp. 230-236, 2022. https://doi.org/10.11999/JEIT200933
  21. T. Yin, T. Liu, and Y. Guo, "Research on single-channel blind deconvolution algorithm for ancient Chinese character restoration," Small Microcomputer System, vol. 42, no. 2, pp. 414-417, 2021. https://doi.org/10.3969/j.issn.1000-1220.2021.02.033
  22. Z. Yi, H. Zhang, P. Tan, and M. Gong, "DualGAN: unsupervised dual learning for image-to-image translation," in Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 2017, pp. 2849-2857. https://doi.org/10.1109/ICCV.2017.310