• Title/Summary/Keyword: Background distortion

Search Result 104, Processing Time 0.022 seconds

Recent Trends in Deep Learning-Based Optical Character Recognition (딥러닝 기반 광학 문자 인식 기술 동향)

  • Min, G.;Lee, A.;Kim, K.S.;Kim, J.E.;Kang, H.S.;Lee, G.H.
    • Electronics and Telecommunications Trends
    • /
    • v.37 no.5
    • /
    • pp.22-32
    • /
    • 2022
  • Optical character recognition is a primary technology required in different fields, including digitizing archival documents, industrial automation, automatic driving, video analytics, medicine, and financial institution, among others. It was created in 1928 using pattern matching, but with the advent of artificial intelligence, it has since evolved into a high-performance character recognition technology. Recently, methods for detecting curved text and characters existing in a complicated background are being studied. Additionally, deep learning models are being developed in a way to recognize texts in various orientations and resolutions, perspective distortion, illumination reflection and partially occluded text, complex font characters, and special characters and artistic text among others. This report reviews the recent deep learning-based text detection and recognition methods and their various applications.

Testing LCDM with eBOSS / SDSS

  • Keeley, Ryan E.;Shafieloo, Arman;Zhao, Gong-bo;Koo, Hanwool
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.46 no.1
    • /
    • pp.47.3-47.3
    • /
    • 2021
  • In this talk I will review recent progress that the SDSS-IV / eBOSS collaboration has made in constraining cosmology from the clustering of galaxies, quasars and the Lyman-alpha forest. The SDSS-IV / eBOSS collaboration has measured the baryon acoustic oscillation (BAO) and redshift space distortion (RSD) features in the correlation function in redshift bins from z~0.15 to z~2.33. These features constitute measurements of angular diameter distances, Hubble distances, and growth rate measurements. A number of consistency tests have been performed between the BAO and RSD datasets and additional cosmological datasets such as the Planck cosmic microwave background constraints, the Pantheon Type Ia supernova compilation, and the weak lensing results from the Dark Energy Survey. Taken together, these joint constraints all point to a broad consistency with the standard model of cosmology LCDM + GR, though they remain in tension with local measurements of the Hubble parameter.

  • PDF

A Study on the Surrealism Expressed in Fashion Illustrations (패션일러스트레이션에 나타난 초현실주의 표현에 관한 연구)

  • Kim Soo-Kyong
    • Journal of the Korea Fashion and Costume Design Association
    • /
    • v.6 no.3
    • /
    • pp.123-138
    • /
    • 2004
  • The objective of this study is to review the expressive effect of fashion illustrations and to enlarge its expressive range by analysing the correlations between fashion illustrations and surrealistic expressive characteristics. The results are as follows. The surrealistic expressive methods of fashion illustration such as depaysement and automatism have been used by many foreign and domestic fashion illustrators. I think that these methods reinforce the subjective aspect of expressive effects and enlarge the transmitting function of fashion message. The elements of expression are divided into color, texture, shape/form and space. Subjective colors rather than natural colors are used for the surrealistic expressive effect of color in fashion illustrations, and illusion of space is expressed by the change of brightness. The surrealistic expressive effect in fashion illustrations distorted the experience of factual texture and disturbed the visual texture. The use of tactile texture presented strange effects perspectively. The objects of shape in fashion illustrations were mainly human body. The form of human body presents reconstituted and reinterpreted figure by transformation or distortion according to the intention of the artist rather than realistic and physical form. In fashion illustration, the transformation of the human body itself whose body parts get separated, superposition of exterior material and human body, and the distortion of human body by the change are done. Space is where background and shapes are inter-penetrated because they are ambiguous and untransparent, and where all are in disorder and unbalance. The method of projection uses distance perspective, multiple perspective and amplified perspective representation, and forms a psychological space feeling. And I think the use of surrealistic expressive methods are very useful for the subjective sensitiveness and the transmission of a fashion message, and presents a possibility to develop a new expressive method.

  • PDF

An Exploratory Study on the Establishment and Provision of Universal Literacy for Sustainable Development in the Era of Fake News (가짜뉴스의 시대, 지속가능한 발전을 위한 보편적 리터러시의 구축 및 제공에 대한 실험적 연구)

  • Lee, Jeong-Mee
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.55 no.1
    • /
    • pp.85-106
    • /
    • 2021
  • The purpose of this study is to examine the concept and definition of fake news focusing on misinformation/false information and is to examine the ways in which our society can respond to the distortion of social reality and damage to democracy caused by information distortion such as fake news. To do this, the concept of fake news was examined based on the level of facticity and intention to device, and our social environment in which fake news was created and spread was examined from the perspective of datafication. In this environment, the library community, which plays a pivotal role in human access to and use of information, argued that it should strive to establish and provide universal literacy education in order to realize the Sustainable Development Goals of the UN 2030 agenda. The core of universal literacy education is to understand the society by investigating and analyzing data communication types according to the degree of datafication and the political, economic, social, and cultural background of society. For this reason, it was concluded that universal literacy should be implemented flexibly according to the degree of datafiation and users of each society.

A Korean CAPTCHA Study: Defeating OCRs In a New CAPTCHA Context By Using Korean Syllables

  • Yang, Tae-Cheon;Ince, Ibrahim Furkan;Salman, Yucel Datu
    • International Journal of Contents
    • /
    • v.5 no.3
    • /
    • pp.50-56
    • /
    • 2009
  • Internet is being used for several activities by a great range of users. These activities include communication, e-commerce, education, and entertainment. Users are required to register regarding website in order to enroll web activities. However, registration can be done by automated hacking software. That software make false enrollments which occupy the resources of the website by reducing the performance and efficiency of servers, even stop the entire web service. It is crucial for the websites to have a system which has the capability of differing human users and computer programs in reading images of text. Completely Automated Public Turing Test to Tell Computers and Human Apart (CAPTCHA) is such a defense system against Optical Character Recognition (OCR) software. OCR can be defined as software which work for defeating CAPTCHA images and make countless number of registrations on the websites. This study proposes a new CAPTCHA context that is Korean CAPTCHA by means of the method which is splitting CAPTCHA images into several parts with random rotation values, and drawing random lines on a grid background by using Korean characters only. Lines are in the same color with the CAPTCHA text and they provide a distortion of image with grid background. Experimental results show that Korean CAPTCHA is a more secure and effective CAPTCHA type for Korean users rather than current CAPTCHA types due to the structure of Korean letters and the algorithm we are using: rotation and splitting. In this paper, the algorithm of our method is introduced in detail.

Speech Spectrum Enhancement Combined with Frequency-weighted Spectrum Shaping Filter and Wiener Filter (주파수가중 스펙트럼성형필터와 위너필터를 결합한 음성 스펙트럼 강조)

  • Choi, Jae-Seung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.20 no.10
    • /
    • pp.1867-1872
    • /
    • 2016
  • In the area of digital signal processing, it is necessary to improve the quality of the speech signal after removing the background noise which exists in a various real environments. The important thing to consider when removing the background noise acoustically is that to solve the problem, depending on the information of the human auditory mechanism is mainly the amplitude spectrum of the speech signal. This paper introduces the characteristics of a frequency-weighted spectrum shaping filter for the extraction of the amplitude spectrum of the speech signal with the primary purpose. Therefore, this paper proposes an algorithm using the methods of a Wiener filter and the frequency-weighted spectrum shaping filter according to the acoustic model, after extracted the amplitude spectral information in the noisy speech signal. The spectral distortion (SD) output of the proposed algorithm is experimentally improved more than 5.28 dB compared to a conventional method.

An Application of the Kalman Filter for Attenuation of Colored Noise Superimposed on Speech Signal (칼만필터를 이용한 음성신호에 중첩된 유색잡음의 감쇠)

  • Gu, Bon-Eung
    • The Journal of the Acoustical Society of Korea
    • /
    • v.13 no.2
    • /
    • pp.76-85
    • /
    • 1994
  • A speech enhancement algorithm which attenuates nonstationary colored noise is presented In this paper. The algorithm consists of a stationary Kalman filter and the simple speech/nonspeech detector. While the conventional enhancement systems are focused on a stationary and/or white background noise, this study Is focused on the mort realistic nonstationary and nonwhite noise. An AR model-based vector Kalman filter is used as a noise suppression system and a short-time energy threshold logic is used as a speech/nonspeech classifier. For Kalman filtering. noise coefficients are estimated in the nonspeech frame, and speech coefficients are estimated by applying the EM iteration algorithm. Simulation results using the car noise are presented based on the signal-to-noise ratio and informal listening tests. According to the experimental results, background noises in the nonspeech frames are eliminated almost completely, while some distortions are noticed in the speech frames. The distortion becomes severer as the SNR is reduced to 0dB and -5dB. Intelligibility, however, is not degraded significantly.

  • PDF

Raindrop Removal and Background Information Recovery in Coastal Wave Video Imagery using Generative Adversarial Networks (적대적생성신경망을 이용한 연안 파랑 비디오 영상에서의 빗방울 제거 및 배경 정보 복원)

  • Huh, Dong;Kim, Jaeil;Kim, Jinah
    • Journal of the Korea Computer Graphics Society
    • /
    • v.25 no.5
    • /
    • pp.1-9
    • /
    • 2019
  • In this paper, we propose a video enhancement method using generative adversarial networks to remove raindrops and restore the background information on the removed region in the coastal wave video imagery distorted by raindrops during rainfall. Two experimental models are implemented: Pix2Pix network widely used for image-to-image translation and Attentive GAN, which is currently performing well for raindrop removal on a single images. The models are trained with a public dataset of paired natural images with and without raindrops and the trained models are evaluated their performance of raindrop removal and background information recovery of rainwater distortion of coastal wave video imagery. In order to improve the performance, we have acquired paired video dataset with and without raindrops at the real coast and conducted transfer learning to the pre-trained models with those new dataset. The performance of fine-tuned models is improved by comparing the results from pre-trained models. The performance is evaluated using the peak signal-to-noise ratio and structural similarity index and the fine-tuned Pix2Pix network by transfer learning shows the best performance to reconstruct distorted coastal wave video imagery by raindrops.

Design of Visual Quantizer for very low Bit-rate Coding on JPEG2000 (JPEG2000에서 저 전송 부호화를 위한 비주얼 양자화기 설계)

  • Kim, Dong-Hyeok;Jeon, Joon-Hyeon
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.47 no.4
    • /
    • pp.69-78
    • /
    • 2010
  • The irreversible 9/7 JPEG2000, which is one of sub-band coding techniques, has a problem of severe picture quality distortion at the edge and the background caused by the quantization error below 0.15bpp. In this paper, to solve such problems we propose a VQ(Visual Quantizer) based on L-pdf(Laplace probability density function) statistical characteristics of high frequency sub-bands. The proposed VQ is designed by visual parameter for improving the subjective quality and weighting parameter for increasing the compression ratio. A proposed method, based on 9/7 JPEG2000 scheme, gives the high subjective quality to reconstructed images below 0.15bpp and provides minimum MSE(Mean-Squared Error) regardless of the compression ratio.

Depth Map Coding Using Histogram-Based Segmentation and Depth Range Updating

  • Lin, Chunyu;Zhao, Yao;Xiao, Jimin;Tillo, Tammam
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.9 no.3
    • /
    • pp.1121-1139
    • /
    • 2015
  • In texture-plus-depth format, depth map compression is an important task. Different from normal texture images, depth maps have less texture information, while contain many homogeneous regions separated by sharp edges. This feature will be employed to form an efficient depth map coding scheme in this paper. Firstly, the histogram of the depth map will be analyzed to find an appropriate threshold that segments the depth map into the foreground and background regions, allowing the edge between these two kinds of regions to be obtained. Secondly, the two regions will be encoded through rate distortion optimization with a shape adaptive wavelet transform, while the edges are lossless encoded with JBIG2. Finally, a depth-updating algorithm based on the threshold and the depth range is applied to enhance the quality of the decoded depth maps. Experimental results demonstrate the effective performance on both the depth map quality and the synthesized view quality.