• Title/Summary/Keyword: Image-to-image Translation

Search Result 306, Processing Time 0.035 seconds

A Study on Analysis of Variant Factors of Recognition Performance for Lip-reading at Dynamic Environment (동적 환경에서의 립리딩 인식성능저하 요인분석에 대한 연구)

  • 신도성;김진영;이주헌
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.5
    • /
    • pp.471-477
    • /
    • 2002
  • Recently, lip-reading has been studied actively as an auxiliary method of automatic speech recognition(ASR) in noisy environments. However, almost of research results were obtained based on the database constructed in indoor condition. So, we dont know how developed lip-reading algorithms are robust to dynamic variation of image. Currently we have developed a lip-reading system based on image-transform based algorithm. This system recognize 22 words and this word recognizer achieves word recognition of up to 53.54%. In this paper we present how stable the lip-reading system is in environmental variance and what the main variant factors are about dropping off in word-recognition performance. For studying lip-reading robustness we consider spatial valiance (translation, rotation, scaling) and illumination variance. Two kinds of test data are used. One Is the simulated lip image database and the other is real dynamic database captured in car environment. As a result of our experiment, we show that the spatial variance is one of degradations factors of lip reading performance. But the most important factor of degradation is not the spatial variance. The illumination variances make severe reduction of recognition rates as much as 70%. In conclusion, robust lip reading algorithms against illumination variances should be developed for using lip reading as a complementary method of ASR.

Analysis on Setup Variation According to Megavoltage Computed Tomography System

  • Kim, Sun-Yung;Kim, Hwa-Sun;Lee, Hae-Kag
    • Journal of Magnetics
    • /
    • v.21 no.3
    • /
    • pp.425-430
    • /
    • 2016
  • The aim of this study was to measure the setup variation for X (lateral), Y (longitude), and Z (vertical) by taking magnetic megavoltage computed tomography (MVCT) before treating the brain, oropharynx, lung, and prostate patients on helical tomotherapy. In this study, 30 patients were chosen for each of the treatment areas, and their skin was labeled with a mark on a treatment planning reference point when taking CT. We preceded MVCT prior to tomotherapy and then conducted an auto registration based on the bony landmarks; image registration was used for automatically matching the patient's setup. Lastly, we confirmed and evaluated the translation coordinates of the images for 30 patients. The following shows the comparison result of the setup errors of each part: X (lateral) showed the highest setup errors with $3.44{\pm}2.05$ from Lung; Y (longitude) showed the highest setup errors showing $3.40{\pm}2.87mm$ from Prostate; and Z (vertical) showed the highest setup errors showing $6.62{\pm}4.38mm$ from Lung. This result verifies that the setup error can be prevented by taking MVCT before the treatment, and Planning Target Volume (PTV) margins can be reduced by referring to the resulting value of each treatment part. Ultimately, the dosage of the normal organs can be decreased as well as any side effects.

Noncontact 3-dimensional measurement using He-Ne laser and CCD camera (He-Ne 레이저와 CCD 카메라를 이용한 비접촉 3차원 측정)

  • Kim, Bong-chae;Jeon, Byung-cheol;Kim, Jae-do
    • Transactions of the Korean Society of Mechanical Engineers A
    • /
    • v.21 no.11
    • /
    • pp.1862-1870
    • /
    • 1997
  • A fast and precise technique to measure 3-dimensional coordinates of an object is proposed. It is essential to take the 3-dimensional measurements of the object in design and inspection. Using this developed system a surface model of a complex shape can be constructed. 3-dimensional world coordinates are projected onto a camera plane by the perspective transformation, which plays an important role in this measurement system. According to the shape of the object two measuring methods are proposed. One is rotation of an object and the other is translation of measuring unit. Measuring speed depending on image processing time is obtained as 200 points per second. Measurement resolution i sexperimented by two parameters among others; the angle between the laser beam plane and the camera, and the distance between the camera and the object. As a result of these experiments, it was found that measurement resolution ranges from 0.3mm to 1.0mm. This constructed surface model could be used in manufacturing tools such as rapid prototyping machine.

RECTIFICATION OF PURE TRANSLATION 2D CAMERA ARRAY

  • Ota, Makoto;Fukushima, Norishige;Yendo, Tomohiro;Tanimoto, Masayuki;Fujii, Toshiaki
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2009.01a
    • /
    • pp.659-663
    • /
    • 2009
  • In this paper, we propose a rectification method that can convert ray space data obtained by controlled camera array to ideal data. Here, Ideal data is obtained by getting longitudinal and transversal epipolar line between cameras vertical and horizontal. However it is actually difficult to arrange cameras strictly because we arrange cameras by hand. As conventional method, we have use camera-calibration method. But if we use this method there are some errors on the output image. When we generate arbitrary viewpoint images this error is critical problem. We focus attention on ideal trajectory of characteristic point. And to minimize the error directly we parallelize the real one. And we showed usefulness of proposed technique. Then using the proposed technique, we were successful reducing the error to less than 0.5 pixels.

  • PDF

ANALYSIS OF REIGN STYLE AND CALENDAR DAY PRESENTED IN THE EPIGRAPHS OF THE GORYEO DYNASTY (고려시대 금석문에 나타난 연호와 역일 기록 분석)

  • LEE, KI-WON;AHN, YOUNG SOOK;MIHN, BYEONG-HEE
    • Publications of The Korean Astronomical Society
    • /
    • v.31 no.1
    • /
    • pp.1-9
    • /
    • 2016
  • We investigate the records related to the reign style and the calendar day from the epigraphs of the Goryeo dynasty (918 - 1392) in Korea in order to verify and supplement the sexagenary cycle data of the first day in the lunar month of the dynasty. The database of the National Research Institute of Culture Heritage contains a rubbed-copy image, transcription statement, and translation statement for Korean epigraphs as well as 775 epigraphs corresponding to the Goryeo dynasty. The epigraph records are valuable in that, during this time, they were written differently from other historical literature such as the Goryeosa (History of the Goryeo Dynasty), which was compiled in the next dynasty. We find that the Goryeo dynasty, in general, had adopted the reign styles of Chinese dynasties at that time. We also find 159 calendar day records all showing good agreement with the work of Ahn et al. except for dozens of records. Through this study, we can verify the reign styles and the calendar days of the Goryeo dynasty.

Shape Recognition Using Skeleton Image Based on Mathematical Morphology (수리형태론적 스켈리턴 영상을 이용한 형상인식)

  • Jang, Ju-Seok;Son, Yun-Gu
    • The Transactions of the Korea Information Processing Society
    • /
    • v.3 no.4
    • /
    • pp.883-898
    • /
    • 1996
  • In this paper, we propose improved method to recognize the shape for enhancing the quality of the pattern recognition system by compressing the source images. In the proposed method, we reduced the data amount by skeletonizing the source images using mathematical morphology, and then matched patterns after accomplishing the translation and scale normalization, and rotation invariance on the transformed images. Through the scale normalization, it was possible for the shape recognition at minimum amount of the pixel by giving the weight to the skeleton pixel. As the source images was replaced by the skeleton images, it was possible to reduce the amount of data and computational loads dramatically, and so become much faster even with a smaller memory capacity. Through the experiment, we investigated the optimum scale factor and good result was proved when realizing the pattern recognition system.

  • PDF

Object Recognition by Fourier Descriptor (푸리에 서술자를 이용한 물체 인식)

  • O, Chun-Seok;Park, Yong-Beom
    • The Transactions of the Korea Information Processing Society
    • /
    • v.1 no.1
    • /
    • pp.73-80
    • /
    • 1994
  • Fourier Descriptors(FD) is a common way for representing the boundary of an object. In this paper, an algorithm has been implemented to do object recognition by using FD. This is applied to various tool object, and is tested. This implementation contains two parts: image acquisition and object recognition. Appropriate lighting, viewing angle, and strong contrast of background and object are taken into account in this aspect. Minimum distances are calculated by using FD's and boundary matching among objects on the process of object recognition. Rotation, translation and scaling of the object will not influence the performance of the algorithm. Experiments show that we can use only one fourth of 1024 FD coefficients to do raped object recognition.

  • PDF

Validation of Korean Version of the Social Appearance Anxiety Scale (한국판 사회적 외모불안 척도(Korean Version of the Social Appearance Anxiety Scale, K-SAAS) 타당화)

  • Minji Lee;Mirihae Kim;Jung-Ho Kim
    • Anxiety and mood
    • /
    • v.19 no.1
    • /
    • pp.1-9
    • /
    • 2023
  • Objective : To translate and adapt the Social Appearance Anxiety Scale into Korean and validate the Korean version of the social appearance anxiety scale, which measures the fear and anxiety about being negatively evaluated by others based on one's overall appearance, including body shape. Methods : For item translation and adaptation, six bilingual translators participated in the process of forward-adaptation and back-adaptation. Data were collected from undergraduate students. The sample size is 105 for Study 1 and 212 for Study 2. Classical item discrimination and difficulty analyses, exploratory factor analysis (EFA), confirmatory factor analysis (CFA), and reliability analysis were performed. Results : A unidimensional structure was found with a high internal consistency (Cronbach's α=0.95) and a high test-retest reliability (r=0.918). In addition, the concurrent validity was examined by correlations of the scale and several other scales measuring constructs related to social appearance anxiety. Conclusion : K-SAAS appears to be a reliable and valid scale for screening and assessing social appearance anxiety.

A Study on the Diversity of Shanghan(傷寒) Concept in Gangpyeong-Sanghanlun(康平傷寒論) (『강평상한론(康平傷寒論)』 내 '상한(傷寒)' 개념의 다양성에 대한 고찰)

  • Lee, Soong-In;Jeong, Jong-Kil
    • Journal of Korean Medical classics
    • /
    • v.28 no.1
    • /
    • pp.97-110
    • /
    • 2015
  • Objectives : Usually medical terminology of oriental mecidine has a multiple meaning. But concept of Shanghan(傷寒) should be simple, because Shanghanlun(傷寒論) is a clinical guideline book. So I researched to suggest many concept of Shanghan, which are suitable for each chapter of Shanghanlun. Methods : I enumerated provisions including Shnaghan from the original texts of Gangpyeong-Shanghanlun(康平傷寒論). And I translated and reviewed them. Results : 1. Shanghan of Preface(序文) means a disease of high fatality. 2. Shanghan of Shanghanrye(傷寒例) means diseases due to physical damage of cold weather. 3. Shanghan of Diagnosis of Daeyang Disease(辨大陽病) - Neck stiffness(痙), Dampness(濕), Sun stroke(暍) means certain disease names accompanying fever, chill. 4. Shanghan used in Diagnosis of Diseases is a premise of many provisions of Shanghanlun. And Shanghan is made up of finished fever, expected fever, chill, body pain, loss of appetite, image of tension. Conclusions : We can use a appropriate translation on Shanghan of each chapter of Gangpyeong-Sanghanlun. Especially Shanghan used in "Diagnosis of Diseases" should have more accurate meaning.

An Improved 2-D Moment Algorithm for Pattern Classification

  • Yoon, myoung-Young
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.4 no.2
    • /
    • pp.1-6
    • /
    • 1999
  • We propose a new algorithm for pattern classification by extracting feature vectors based on Gibbs distributions which are well suited for representing the characteristic of an images. The extracted feature vectors are comprised of 2-D moments which are invariant under translation rotation, and scale of the image less sensitive to noise. This implementation contains two puts: feature extraction and pattern classification First of all, we extract feature vector which consists of an improved 2-D moments on the basis of estimated Gibbs distribution Next, in the classification phase the minimization of the discrimination cost function for a specific pattern determines the corresponding template pattern. In order to evaluate the performance of the proposed scheme, classification experiments with training document sets of characters have been carried out on SUN ULTRA 10 Workstation Experiment results reveal that the proposed scheme had high classification rate over 98%.

  • PDF