• Title/Summary/Keyword: Image-to-image Translation

Search Result 306, Processing Time 0.034 seconds

Vision-based Target Tracking for UAV and Relative Depth Estimation using Optical Flow (무인 항공기의 영상기반 목표물 추적과 광류를 이용한 상대깊이 추정)

  • Jo, Seon-Yeong;Kim, Jong-Hun;Kim, Jung-Ho;Lee, Dae-Woo;Cho, Kyeum-Rae
    • Journal of the Korean Society for Aeronautical & Space Sciences
    • /
    • v.37 no.3
    • /
    • pp.267-274
    • /
    • 2009
  • Recently, UAVs (Unmanned Aerial Vehicles) are expected much as the Unmanned Systems for various missions. These missions are often based on the Vision System. Especially, missions such as surveillance and pursuit have a process which is carried on through the transmitted vision data from the UAV. In case of small UAVs, monocular vision is often used to consider weights and expenses. Research of missions performance using the monocular vision is continued but, actually, ground and target model have difference in distance from the UAV. So, 3D distance measurement is still incorrect. In this study, Mean-Shift Algorithm, Optical Flow and Subspace Method are posed to estimate the relative depth. Mean-Shift Algorithm is used for target tracking and determining Region of Interest (ROI). Optical Flow includes image motion information using pixel intensity. After that, Subspace Method computes the translation and rotation of image and estimates the relative depth. Finally, we present the results of this study using images obtained from the UAV experiments.

Using Contour Matching for Omnidirectional Camera Calibration (투영곡선의 자동정합을 이용한 전방향 카메라 보정)

  • Hwang, Yong-Ho;Hong, Hyun-Ki
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.45 no.6
    • /
    • pp.125-132
    • /
    • 2008
  • Omnidirectional camera system with a wide view angle is widely used in surveillance and robotics areas. In general, most of previous studies on estimating a projection model and the extrinsic parameters from the omnidirectional images assume corresponding points previously established among views. This paper presents a novel omnidirectional camera calibration based on automatic contour matching. In the first place, we estimate the initial parameters including translation and rotations by using the epipolar constraint from the matched feature points. After choosing the interested points adjacent to more than two contours, we establish a precise correspondence among the connected contours by using the initial parameters and the active matching windows. The extrinsic parameters of the omnidirectional camera are estimated minimizing the angular errors of the epipolar plane of endpoints and the inverse projected 3D vectors. Experimental results on synthetic and real images demonstrate that the proposed algorithm obtains more precise camera parameters than the previous method.

Fingerprint Recognition using Linking Information of Minutiae (특징점의 연결정보를 이용한 지문인식)

  • Cha, Heong-Hee;Jang, Seok-Woo;Kim, Gye-Young;Choi, Hyung-Il
    • The KIPS Transactions:PartB
    • /
    • v.10B no.7
    • /
    • pp.815-822
    • /
    • 2003
  • Fingerprint image enhancement and minutiae matching are two key steps in an automatic fingerprint identification system. In this paper, we propose a fingerprint recognition technique by using minutiae linking information. Recognition process have three steps ; preprocessing, minutiae extraction, matching step based on minutiae pairing. After extracting minutiae of a fingerprint from its thinned image for accuracy, we introduce matching process using minutiae linking information. Introduction of linking information into the minutiae matching process is a simple but accurate way, which solves the problem of reference minutiae pair selection with low cost in comparison stage of two fingerprints. This algorithm is invariable to translation and rotation of fingerprint. The matching algorithm was tested on 500 images from the semiconductor chip style scanner, experimental result revealed the false acceptance rate is decreased and genuine acceptance rate is increased than existing method.

Cost Effective Mobility Anchor Point Selection Scheme for F-HMIPv6 Networks (F-HMIPv6 환경에서의 비용 효율적인 MAP 선택 기법)

  • Roh Myoung-Hwa;Jeong Choong-Kyo
    • KSCI Review
    • /
    • v.14 no.1
    • /
    • pp.265-271
    • /
    • 2006
  • In this paper, we propose a new automatic fingerprint identification system that identifies individuals in large databases. The algorithm consists of three steps: preprocessing, classification, and matching, in the classification, we present a new classification technique based on the statistical approach for directional image distribution. In matching, we also describe improved minutiae candidate pair extraction algorithm that is faster and more accurate than existing algorithm. In matching stage, we extract fingerprint minutiaes from its thinned image for accuracy, and introduce matching process using minutiae linking information. Introduction of linking information into the minutiae matching process is a simple but accurate way, which solves the problem of reference minutiae pair selection in comparison stage of two fingerprints quickly. This algorithm is invariant to translation and rotation of fingerprint. The proposed system was tested on 1000 fingerprint images from the semiconductor chip style scanner. Experimental results reveal false acceptance rate is decreased and genuine acceptance rate is increased than existing method.

  • PDF

Considerations for Applying Korean Natural Language Processing Technology in Records Management (기록관리 분야에서 한국어 자연어 처리 기술을 적용하기 위한 고려사항)

  • Haklae, Kim
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.22 no.4
    • /
    • pp.129-149
    • /
    • 2022
  • Records have temporal characteristics, including the past and present; linguistic characteristics not limited to a specific language; and various types categorized in a complex way. Processing records such as text, video, and audio in the life cycle of records' creation, preservation, and utilization entails exhaustive effort and cost. Primary natural language processing (NLP) technologies, such as machine translation, document summarization, named-entity recognition, and image recognition, can be widely applied to electronic records and analog digitization. In particular, Korean deep learning-based NLP technologies effectively recognize various record types and generate record management metadata. This paper provides an overview of Korean NLP technologies and discusses considerations for applying NLP technology in records management. The process of using NLP technologies, such as machine translation and optical character recognition for digital conversion of records, is introduced as an example implemented in the Python environment. In contrast, a plan to improve environmental factors and record digitization guidelines for applying NLP technology in the records management field is proposed for utilizing NLP technology.

Computed Tomographic Simulation of Craniospinal Irradiation (전산화 단층 촬영 장치를 이용한 뇌척수 조사의 치료 계획)

  • Lee CI;Kim HN;Oh TY;Hwang DS;Park NS;Kye CS;Kim YS
    • The Journal of Korean Society for Radiation Therapy
    • /
    • v.11 no.1
    • /
    • pp.53-59
    • /
    • 1999
  • The aim of this study is to improve the accuracy of field placement and junction between adjacent fields and block shielding through the use of a computed tomography(CT) simulator and virtual simulation. The information was acquired by assessment of Alderson Rando phantom image using CT simulator (I.Q. Xtra - Picker), determination of each field by virtual fluoroscopy of voxel IQ workstation AcQsim and colored critical structures that were obtained by contouring in virtual simulation. And also using a coronal, sagittal and axial view can determine the field and adjacent field gap correctly without calculation during the procedure. With the treatment planning by using the Helax TMS 4.0, the dose in the junction among the adjacent fields and the spinal cord and cribriform plate of the critical structure was evaluated by the dose volume histogram. The pilot image of coronal and sagittal view took about 2minutes and 26minutes to get 100 images. Image translation to the virtual simulation workstation took about 6minutes. Contouring a critical structure such as cribriform plate, spinal cord using a virtual fluoroscopy were eligible to determine a correct field and shielding. The process took about 20 minutes. As the result of the Helax planning, the dose distribution in adjacent field junction was ideal, and the dose level shows almost 100 percentage in the dose volume histogram of the spinal cord and cribriform plate CT simulation can get a correct therapy area due to enhancement of critical structures such as spinal cord and cribriform plate. In addition, using a Spiral CT scanner can be saved a lot of time to plan a simulation therefore this function can reduce difficulties to keep the patient position without any movements to the patient, physician and radiotherapy technician.

  • PDF

An Optical Flow Based Time-to-Collision Predictor

  • Yamaguchi, T.;Kashiwagi, H.;Harada, H.
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1998.10a
    • /
    • pp.232-237
    • /
    • 1998
  • This paper describes a new method for estimating time-to-collision which exhibits high tolerance to noise contained in camera images. Time to collision (TTC) is one of the most important parameters available from a camera attached to a mobile machine. TTC indirectly stands far the translation speed of the camera and is usually calculated either from successive images or optical flow by using intimate relationship between TTC and flow divergence. In most cases, however, it is not easy to get accurate optical flow, which makes it difficult to calculate TTC. In this study it is proved that if the target has a smooth surface, the average of divergence over any point-symmetric region on the image is equal to the divergence of the center of the region. It means that required divergence can be calculated by integrating optical flow vectors over a symmetric region. It is expected that in the process of the integration, accidental noise is canceled if they are independent of optical flow and the motion of the camera. Experimental results show that TTC can be estimated regardless of the surface condition. It is also shown that influence of noise is eliminated as the area of integration increases.

  • PDF

Validation Data Augmentation for Improving the Grading Accuracy of Diabetic Macular Edema using Deep Learning (딥러닝을 이용한 당뇨성황반부종 등급 분류의 정확도 개선을 위한 검증 데이터 증강 기법)

  • Lee, Tae Soo
    • Journal of Biomedical Engineering Research
    • /
    • v.40 no.2
    • /
    • pp.48-54
    • /
    • 2019
  • This paper proposed a method of validation data augmentation for improving the grading accuracy of diabetic macular edema (DME) using deep learning. The data augmentation technique is basically applied in order to secure diversity of data by transforming one image to several images through random translation, rotation, scaling and reflection in preparation of input data of the deep neural network (DNN). In this paper, we apply this technique in the validation process of the trained DNN, and improve the grading accuracy by combining the classification results of the augmented images. To verify the effectiveness, 1,200 retinal images of Messidor dataset was divided into training and validation data at the ratio 7:3. By applying random augmentation to 359 validation data, $1.61{\pm}0.55%$ accuracy improvement was achieved in the case of six times augmentation (N=6). This simple method has shown that the accuracy can be improved in the N range from 2 to 6 with the correlation coefficient of 0.5667. Therefore, it is expected to help improve the diagnostic accuracy of DME with the grading information provided by the proposed DNN.

Adopting Process Management-the Importance of Recognizing the Organizational Transformation

  • Hellstrom, Andreas;Peterson, Jonas
    • International Journal of Quality Innovation
    • /
    • v.7 no.1
    • /
    • pp.20-34
    • /
    • 2006
  • The purpose of this study is to investigate what happens within an organization when a process view of the business is adopted. With the example of an empirical case, we aim to illustrate: how members of the organization make sense of process management; what contributions members of the organization consider to be the result of adopting a process view; and the relationship between the functional and the process structure. The empirical base in this study is one of Sweden's largest purchasing organizations within the public sector. The results are drawn from interviews with the process owners and a survey to all members involved in process teams. The case findings reveal an ambiguous image of process management. At the same time as process management solved specific organizational problems, it generated new dilemmas. It is argued that it is more rewarding to consider the adoption of the process view a 'social negotiation' rather than the result of planned implementation. The study also highlights that the meaning of process management is not anything given but something being created, and its negotiation and translation into organizational practice is open-ended. Furthermore, the study gives an illustration of the conflict between the adopted process view and the existing functional organization.

Automatic Mobile Screen Translation Using Object Detection Approach Based on Deep Neural Networks (심층신경망 기반의 객체 검출 방식을 활용한 모바일 화면의 자동 프로그래밍에 관한 연구)

  • Yun, Young-Sun;Park, Jisu;Jung, Jinman;Eun, Seongbae;Cha, Shin;So, Sun Sup
    • Journal of Korea Multimedia Society
    • /
    • v.21 no.11
    • /
    • pp.1305-1316
    • /
    • 2018
  • Graphical user interface(GUI) has a very important role to interact with software users. However, designing and coding of GUI are tedious and pain taking processes. In many studies, the researchers are trying to convert GUI elements or widgets to code or describe formally their structures by help of domain knowledge of stochastic methods. In this paper, we propose the GUI elements detection approach based on object detection strategy using deep neural networks(DNN). Object detection with DNN is the approach that integrates localization and classification techniques. From the experimental result, if we selected the appropriate object detection model, the results can be used for automatic code generation from the sketch or capture images. The successful GUI elements detection can describe the objects as hierarchical structures of elements and transform their information to appropriate code by object description translator that will be studied at future.