• Title/Summary/Keyword: ESRGAN

Search Result 5, Processing Time 0.017 seconds

Object Segmentation Using ESRGAN and Semantic Soft Segmentation (ESRGAN과 Semantic Soft Segmentation을 이용한 객체 분할)

  • Dongsik Yoon;Noyoon Kwak
    • Journal of Internet of Things and Convergence
    • /
    • v.9 no.1
    • /
    • pp.97-104
    • /
    • 2023
  • This paper is related to object segmentation using ESRGAN(Enhanced Super Resolution GAN) and SSS(Semantic Soft Segmentation). The segmentation performance of the object segmentation method using Mask R-CNN and SSS proposed by the research team in this paper is generally good, but the segmentation performance is poor when the size of the objects is relatively small. This paper is to solve these problems. The proposed method aims to improve segmentation performance of small objects by performing super-resolution through ESRGAN and then performing SSS when the size of an object detected through Mask R-CNN is below a certain threshold. According to the proposed method, it was confirmed that the segmentation characteristics of small-sized objects can be improved more effectively than the previous method.

A Research on Re-examining Discriminator Design Space for Performance Improvement of ESRGAN (ESRGAN의 성능 향상을 위한 판별자 설계 공간 재검토에 관한 연구)

  • Sung-Wook Park;Jun-Yeong Kim;Jun Park;Se-Hoon Jung;Chun-Bo Sim
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2023.05a
    • /
    • pp.513-514
    • /
    • 2023
  • 초해상은 저해상도의 영상을 고해상도 영상으로 합성하는 기술이다. 이 기술에 딥러닝이 적용되어, 2014년에는 SRCNN(Super Resolution Convolutional Neural Network) 모델이 발표됐다. 이후에는 SRCAE(Super Resolution Convolutional Autoencoders)와 GAN(Generative Adversarial Networks)을 기반으로 한 SRGAN(Super Resolution Generative Adversarial Networks) 등, SRCNN의 성능을 능가하는 모델들이 발표됐다. ESRGAN(Enhanced Super Resolution Generative Adversarial Networks)은 SRGAN 모델의 성능을 개선했지만, 완벽한 성능을 내지 못하는 문제점이 있다. 이에 본 논문에서는 판별자(Discriminator) 구조를 변경하여 ESRGAN의 성능을 개선한다. 실험 결과, 제안하는 모델이 ESRGAN보다 더 높은 성능을 보일 것으로 기대된다.

Performance Improvement of Object Segmentation Using ESRGAN and Semantic Soft Segmentation (ESRGAN과 Semantic Soft Segmentation을 이용한 객체 분할의 성능 개선)

  • Yoon, DongSik;Kwak, Noyoon
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2020.05a
    • /
    • pp.468-471
    • /
    • 2020
  • 본 논문은 ESRGAN(Enhanced Super Resolution GAN)과 Semantic Soft Segmentation을 이용한 객체 분할의 성능 개선에 관한 것이다. 본 논문의 연구진이 이미 제안한 Mask R-CNN과 Semantic Soft Segmentation을 이용한 객체 분할 방법은 전반적으로 객체 분할 성능이 양호한 반면, 객체의 크기가 상대적으로 작으면 분할 성능이 저조해지는 문제점이 있었다. 본 논문은 이러한 문제점을 해결하기 위한 것으로, Mask R-CNN을 통해 검출된 객체의 크기가 일정 기준치 이하인 경우, ESRGAN을 통해 초해상화를 수행한 후, Semantic Soft Segmentation을 수행함으로써 소형 객체의 분할 성능을 개선함에 그 목적이 있다. 제안된 방법에 따르면, 기존의 방볍에 비해 크기가 작은 객체의 분할 특성을 좀 더 효과적으로 개선할 수 있음을 확인할 수 있었다.

Development of compound eye image quality improvement based on ESRGAN (ESRGAN 기반의 복안영상 품질 향상 알고리즘 개발)

  • Taeyoon Lim;Yongjin Jo;Seokhaeng Heo;Jaekwan Ryu
    • Journal of the Korea Computer Graphics Society
    • /
    • v.30 no.2
    • /
    • pp.11-19
    • /
    • 2024
  • Demand for small biomimetic robots that can carry out reconnaissance missions without being exposed to the enemy in underground spaces and narrow passages is increasing in order to increase the fighting power and survivability of soldiers in wartime situations. A small compound eye image sensor for environmental recognition has advantages such as small size, low aberration, wide angle of view, depth estimation, and HDR that can be used in various ways in the field of vision. However, due to the small lens size, the resolution is low, and the problem of resolution in the fused image obtained from the actual compound eye image occurs. This paper proposes a compound eye image quality enhancement algorithm based on Image Enhancement and ESRGAN to overcome the problem of low resolution. If the proposed algorithm is applied to compound eye image fusion images, image resolution and image quality can be improved, so it is expected that performance improvement results can be obtained in various studies using compound eye cameras.

Comparative analysis of the deep-learning-based super-resolution methods for generating high-resolution texture maps (고해상도 텍스처 맵 생성을 위한 딥러닝 기반 초해상도 기법들의 비교 분석 연구)

  • Hyeju Kim;Jah-Ho Nah
    • Journal of the Korea Computer Graphics Society
    • /
    • v.29 no.5
    • /
    • pp.31-40
    • /
    • 2023
  • As display resolution increases, many apps also tend to include high-resolution texture maps. Recent advancements in deep-learning-based image super-resolution techniques make it possible to automate high-resolution texture generation. However, there is still a lack of comprehensive analysis of the application of these techniques to texture maps. In this paper, we selected three recent super-resolution techniques, namely BSRGAN, Real-ESRGAN, and SwinIR (classical and real-world image SR), and applied them to upscale texture maps. We then conducted a quantitative and qualitative analysis of the experimental results. The findings revealed various artifacts after upscaling, which indicates that there are still limitations in directly applying super-resolution techniques to texture-map upscaling.