• Title/Summary/Keyword: Image-to-image Translation

Search Result 306, Processing Time 0.177 seconds

Rotation and Translation Invariant Feature Extraction Using Angular Projection in Frequency Domain (주파수 영역에서 각도 투영법을 이용한 회전 및 천이 불변 특징 추출)

  • Lee, Bum-Shik;Kim, Mun-Churl
    • Journal of the HCI Society of Korea
    • /
    • v.1 no.2
    • /
    • pp.27-33
    • /
    • 2006
  • This paper presents a new approach to translation and rotation invariant feature extraction for image texture retrieval. For the rotation invariant feature extraction, we invent angular projection along angular frequency in Polar coordinate system. The translation and rotation invariant feature vector for representing texture images is constructed by the averaged magnitude and the standard deviations of the magnitude of the Fourier transform spectrum obtained by the proposed angular projection. In order to easily implement the angular projection, the Radon transform is employed to obtain the Fourier transform spectrum of images in the Polar coordinate system. Then, angular projection is applied to extract the feature vector. We present our experimental results to show the robustness against the image rotation and the discriminatory capability for different texture images using MPEG-7 data set. Our Experiment result shows that the proposed rotation and translation invariant feature vector is effective in retrieval performance for the texture images with homogeneity, isotropy and local directionality.

  • PDF

Eyeglass Remover Network based on a Synthetic Image Dataset

  • Kang, Shinjin;Hahn, Teasung
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.4
    • /
    • pp.1486-1501
    • /
    • 2021
  • The removal of accessories from the face is one of the essential pre-processing stages in the field of face recognition. However, despite its importance, a robust solution has not yet been provided. This paper proposes a network and dataset construction methodology to remove only the glasses from facial images effectively. To obtain an image with the glasses removed from an image with glasses by the supervised learning method, a network that converts them and a set of paired data for training is required. To this end, we created a large number of synthetic images of glasses being worn using facial attribute transformation networks. We adopted the conditional GAN (cGAN) frameworks for training. The trained network converts the in-the-wild face image with glasses into an image without glasses and operates stably even in situations wherein the faces are of diverse races and ages and having different styles of glasses.

Robot Posture Estimation Using Circular Image of Inner-Pipe (원형관로 영상을 이용한 관로주행 로봇의 자세 추정)

  • Yoon, Ji-Sup;Kang , E-Sok
    • The Transactions of the Korean Institute of Electrical Engineers D
    • /
    • v.51 no.6
    • /
    • pp.258-266
    • /
    • 2002
  • This paper proposes the methodology of the image processing algorithm that estimates the pose of the inner-pipe crawling robot. The inner-pipe crawling robot is usually equipped with a lighting device and a camera on its head for monitoring and inspection purpose of defects on the pipe wall and/or the maintenance operation. The proposed methodology is using these devices without introducing the extra sensors and is based on the fact that the position and the intensity of the reflected light from the inner wall of the pipe vary with the robot posture and the camera. The proposed algorithm is divided into two parts, estimating the translation and rotation angle of the camera, followed by the actual pose estimation of the robot . Based on the fact that the vanishing point of the reflected light moves into the opposite direction from the camera rotation, the camera rotation angle can be estimated. And, based on the fact that the most bright parts of the reflected light moves into the same direction with the camera translation, the camera position most bright parts of the reflected light moves into the same direction with the camera translation, the camera position can be obtained. To investigate the performance of the algorithm, the algorithm is applied to a sewage maintenance robot.

Assessment of Gradient-based Digital Speckle Correlation Measurement Errors

  • Jian, Zhao;Dong, Zhao;Zhe, Zhang
    • Journal of the Optical Society of Korea
    • /
    • v.16 no.4
    • /
    • pp.372-380
    • /
    • 2012
  • The optical method Digital Speckle Correlation Measurement (DSCM) has been extensively applied due its capability to measure the entire displacement field over a body surface. A formula of displacement measurement errors by the gradient-based DSCM method was derived. The errors were found to explicitly relate to the image grayscale errors consisting of sub-pixel interpolation algorithm errors, image noise, and subset deformation mismatch at each point of the subset. A power-law dependence of the standard deviation of displacement measurement errors on the subset size was established when the subset deformation was rigid body translation and random image noise was dominant and it was confirmed by both the numerical and experimental results. In a gradient-based algorithm the basic assumption is rigid body translation of the interrogated subsets, however, this is in contradiction to the real circumstances where strains exist. Numerical and experimental results also indicated that, subset shape function mismatch was dominant when the order of the assumed subset shape function was lower than that of the actual subset deformation field and the power-law dependence clearly broke down. The power-law relationship further leads to a simple criterion for choosing a suitable subset size, image quality, sub-pixel algorithm, and subset shape function for DSCM.

Semantic Object Segmentation Using Conditional Generative Adversarial Network with Residual Connections (잔차 연결의 조건부 생성적 적대 신경망을 사용한 시맨틱 객체 분할)

  • Ibrahem, Hatem;Salem, Ahmed;Yagoub, Bilel;Kang, Hyun Su;Suh, Jae-Won
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.12
    • /
    • pp.1919-1925
    • /
    • 2022
  • In this paper, we propose an image-to-image translation approach based on the conditional generative adversarial network for semantic segmentation. Semantic segmentation is the task of clustering parts of an image together which belong to the same object class. Unlike the traditional pixel-wise classification approach, the proposed method parses an input RGB image to its corresponding semantic segmentation mask using a pixel regression approach. The proposed method is based on the Pix2Pix image synthesis method. We employ residual connections-based convolutional neural network architectures for both the generator and discriminator architectures, as the residual connections speed up the training process and generate more accurate results. The proposed method has been trained and tested on the NYU-depthV2 dataset and could achieve a good mIOU value (49.5%). We also compare the proposed approach to the current methods in semantic segmentation showing that the proposed method outperforms most of those methods.

RST Invariant Digital Watermarking Based on Image Representation by Wedges and Rings

  • Kim, Ki-Jung
    • International Journal of Contents
    • /
    • v.5 no.2
    • /
    • pp.26-31
    • /
    • 2009
  • This paper describes a new image watermarking scheme invariant to rotation, scaling and translation (RST) attacks. For obtaining the invariance properties we propose to present an image of watermark by wedges and rings to convert its rotation to shift and then utilize the shift invariance property of the Direct Fourier Transform (DFT). But in contrast to conversional schemes based on the Fourier-Mellin transform (FMT), we do not use a log-polar mapping (LPM). As a result, our scheme preserves high quality of original image since it is not underwent to LPM For withstanding against JPEG compression, noise addition and low-pass (LP) filtering attacks a low frequency watermark is embedded into middle frequencies of the original image. Experiments with various attacks show the robustness of the proposed scheme.

Image Stabilization Scheme for Arbitrary Disturbance (임의의 외란에 대한 영상 안정화)

  • Kwak, Hwy-Kuen
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.15 no.9
    • /
    • pp.5750-5757
    • /
    • 2014
  • This paper proposes an image stabilization method for arbitrary disturbances, such as rotation, translation and zoom movement, using the SIFT (Scale Invariant Feature Transform). In addition, image stabilization was carried out using the image division and merge technique when moving objects appear on the scene. Finally, the experimental results showed that the suggested image stabilization scheme produced superior performance compared to the previous ones.

Brain MR Multimodal Medical Image Registration Based on Image Segmentation and Symmetric Self-similarity

  • Yang, Zhenzhen;Kuang, Nan;Yang, Yongpeng;Kang, Bin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.3
    • /
    • pp.1167-1187
    • /
    • 2020
  • With the development of medical imaging technology, image registration has been widely used in the field of disease diagnosis. The registration between different modal images of brain magnetic resonance (MR) is particularly important for the diagnosis of brain diseases. However, previous registration methods don't take advantage of the prior knowledge of bilateral brain symmetry. Moreover, the difference in gray scale information of different modal images increases the difficulty of registration. In this paper, a multimodal medical image registration method based on image segmentation and symmetric self-similarity is proposed. This method uses modal independent self-similar information and modal consistency information to register images. More particularly, we propose two novel symmetric self-similarity constraint operators to constrain the segmented medical images and convert each modal medical image into a unified modal for multimodal image registration. The experimental results show that the proposed method can effectively reduce the error rate of brain MR multimodal medical image registration with rotation and translation transformations (average 0.43mm and 0.60mm) respectively, whose accuracy is better compared to state-of-the-art image registration methods.

handwritten Numeral Recognition Based on Modular Neural Networks Utilizing Rotated and Translated Images (회전 및 이동 영상을 이용하는 모듈 구조 신경망 기반 필기체 숫자 인식)

  • Im, Gil-Taek;Nam, Yun-Seok;Jin, Seong-Il
    • The Transactions of the Korea Information Processing Society
    • /
    • v.7 no.6
    • /
    • pp.1834-1843
    • /
    • 2000
  • In this paper, we propose a modular neural network based classification method for handwritten numerals utilizing rotated and translated images of an input image. The whole numeral pattern space is divided into smaller spaces which overlap each other and form multiple clusters. On these multiple clusters, multiple multilayer perceptrons (MLP) neural networks, specialized in those clusters, are constructed. Thus, each MLP acts as an expert network on the corresponding cluster. An MLP is also used as a gating network functioning as a mediator among the multiple MLPs. In the learning phase, an input numeral image is dithered by tow geometric operations of translation and rotation so that new numeral images similar to original one are generated. In the recognition phase, we utilize not only input numeral image, but also nearly generated images through the rotation and the translation of the original image. Thus, multiple output values for those generated images were combined to make class decision by various combination methods. The experimental results confirm the validity of the proposed method.

  • PDF

Digital Image Stabilization Using Simple Estimation of Rotational and Translational Motion (회전 및 병진운동 추정을 통한 디지털 영상안정화)

  • Seok, Ho-Dong;Kang, Kil-Soon;Lyou, Joon
    • Proceedings of the KIEE Conference
    • /
    • 2004.11c
    • /
    • pp.46-48
    • /
    • 2004
  • This paper presents a simple method of rotational and translational motion estimation for digital image stabilization. The scheme first computes the rotation center by taking least squares of selected local velocity vectors, and the rotational angle is found from special subset of motion vectors. And then translational motion can be estimated by the relation among movement of rotation center, rotation angle and translation movement. To show the effectiveness of our approach, the synthetic images are evaluated, resulting in better performance.

  • PDF