Search | Korea Science

An End-to-End Sequence Learning Approach for Text Extraction and Recognition from Scene Image

Lalitha, G.;Lavanya, B.
- International Journal of Computer Science & Network Security
- /
- v.22 no.7
- /
- pp.220-228
- /
- 2022
Image always carry useful information, detecting a text from scene images is imperative. The proposed work's purpose is to recognize scene text image, example boarding image kept on highways. Scene text detection on highways boarding's plays a vital role in road safety measures. At initial stage applying preprocessing techniques to the image is to sharpen and improve the features exist in the image. Likely, morphological operator were applied on images to remove the close gaps exists between objects. Here we proposed a two phase algorithm for extracting and recognizing text from scene images. In phase I text from scenery image is extracted by applying various image preprocessing techniques like blurring, erosion, tophat followed by applying thresholding, morphological gradient and by fixing kernel sizes, then canny edge detector is applied to detect the text contained in the scene images. In phase II text from scenery image recognized using MSER (Maximally Stable Extremal Region) and OCR; Proposed work aimed to detect the text contained in the scenery images from popular dataset repositories SVT, ICDAR 2003, MSRA-TD 500; these images were captured at various illumination and angles. Proposed algorithm produces higher accuracy in minimal execution time compared with state-of-the-art methodologies.
https://doi.org/10.22937/IJCSNS.2022.22.7.27 인용 PDF KSCI

Intelligent Composition of CG and Dynamic Scene (CG와 동영상의 지적합성)

박종일;정경훈;박경세;송재극
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 1995.06a
- /
- pp.77-81
- /
- 1995
Video composition is to integrate multiple image materials into one scene. It considerably enhances the degree of freedom in producing various scenes. However, we need to adjust the viewing point sand the image planes of image planes of image materials for high quality video composition. In this paper, were propose an intelligent video composition technique concentrating on the composition of CG and real scene. We first model the camera system. The projection is assumed to be perspective and the camera motion is assumed to be 3D rotational and 3D translational. Then, we automatically extract camera parameters comprising the camera model from real scene by a dedicated algorithm. After that, CG scene is generated according to the camera parameters of the real scene. Finally the two are composed into one scene. Experimental results justify the validity of the proposed method.

GMM-KL Framework for Indoor Scene Matching (실내 환경 이미지 매칭을 위한 GMM-KL프레임워크)

Kim, Jun-Young;Ko, Han-Seok
- Proceedings of the KIEE Conference
- /
- 2005.10b
- /
- pp.61-63
- /
- 2005
Retreiving indoor scene reference image from database using visual information is important issue in Robot Navigation. Scene matching problem in navigation robot is not easy because input image that is taken in navigation process is affinly distorted. We represent probabilistic framework for the feature matching between features in input image and features in database reference images to guarantee robust scene matching efficiency. By reconstructing probabilistic scene matching framework we get a higher precision than the existing feaure-feature matching scheme. To construct probabilistic framework we represent each image as Gaussian Mixture Model using Expectation Maximization algorithm using SIFT(Scale Invariant Feature Transform).
PDF

Image Scene Classification of Multiclass (다중 클래스의 이미지 장면 분류)

Shin, Seong-Yoon;Lee, Hyun-Chang;Shin, Kwang-Seong;Kim, Hyung-Jin;Lee, Jae-Wan
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2021.10a
- /
- pp.551-552
- /
- 2021
In this paper, we present a multi-class image scene classification method based on transformation learning. ImageNet classifies multiple classes of natural scene images by relying on pre-trained network models on large image datasets. In the experiment, we obtained excellent results by classifying the optimized ResNet model on Kaggle's Intel Image Classification data set.
PDF

Construction Site Scene Understanding: A 2D Image Segmentation and Classification

Kim, Hongjo;Park, Sungjae;Ha, Sooji;Kim, Hyoungkwan
- International conference on construction engineering and project management
- /
- 2015.10a
- /
- pp.333-335
- /
- 2015
A computer vision-based scene recognition algorithm is proposed for monitoring construction sites. The system analyzes images acquired from a surveillance camera to separate regions and classify them as building, ground, and hole. Mean shift image segmentation algorithm is tested for separating meaningful regions of construction site images. The system would benefit current monitoring practices in that information extracted from images could embrace an environmental context.
PDF

Haze Scene Detection based on Hue, Saturation, and Dark Channel Distributions

Lee, Y.;Yang, Seungjoon
- International Journal of Advanced Culture Technology
- /
- v.8 no.4
- /
- pp.229-234
- /
- 2020
Dehazing significantly improves image quality by restoring the loss of contrast and color saturation for images taken in the presence. However, when applied to images not taken according to the prior information, dehazing can cause unintended degradation of image quality. To avoid unintended degradations, we present a hazy scene detection algorithm using a single image based on the distributions of hue, saturation, and dark channel. Through a heuristic approach, we find out statistical characteristics of the distribution of hue, saturation, and dark channels in the hazy scene and make a detection model using them. The proposed method can precede the dehazing to prevent unintended degradation. The detection performance evaluated with a set of test images shows a high hit rate with a low false alarm ratio. Ultimately the proposed method can be used to control the effect of dehazing so that the dehazing can be applied to wide variety of images without unintended degradation of image quality.
https://doi.org/10.17703/IJACT.2020.8.4.229 인용 PDF KSCI

Absolute Depth Estimation Based on a Sharpness-assessment Algorithm for a Camera with an Asymmetric Aperture

Kim, Beomjun;Heo, Daerak;Moon, Woonchan;Hahn, Joonku
- Current Optics and Photonics
- /
- v.5 no.5
- /
- pp.514-523
- /
- 2021
Methods for absolute depth estimation have received lots of interest, and most algorithms are concerned about how to minimize the difference between an input defocused image and an estimated defocused image. These approaches may increase the complexity of the algorithms to calculate the defocused image from the estimation of the focused image. In this paper, we present a new method to recover depth of scene based on a sharpness-assessment algorithm. The proposed algorithm estimates the depth of scene by calculating the sharpness of deconvolved images with a specific point-spread function (PSF). While most depth estimation studies evaluate depth of the scene only behind a focal plane, the proposed method evaluates a broad depth range both nearer and farther than the focal plane. This is accomplished using an asymmetric aperture, so the PSF at a position nearer than the focal plane is different from that at a position farther than the focal plane. From the image taken with a focal plane of 160 cm, the depth of object over the broad range from 60 to 350 cm is estimated at 10 cm resolution. With an asymmetric aperture, we demonstrate the feasibility of the sharpness-assessment algorithm to recover absolute depth of scene from a single defocused image.
https://doi.org/10.3807/COPP.2021.5.5.514 인용 PDF KSCI

An effective background subtraction in dynamic scene. (동적 환경에서의 효과적인 움직이는 객체 추출)

Han, Jae-Hyek;Kim, Yong-Jin;Ryu, Sae-Woon;Lee, Sang-Hwa;Park, Jong-Il
- 한국HCI학회:학술대회논문집
- /
- 2009.02a
- /
- pp.631-636
- /
- 2009
Foreground segmentation methods have steadily been researched in the field of computer vision. Especially, background subtraction which extracts a foreground image from the difference between the current frame and a reference image, called as "background image" have been widely used for a variety of real-time applications because of low computation and high-quality. However, if the background scene was dynamically changed, the background subtraction causes lots of errors. In this paper, we propose an efficient background subtraction method in dynamic environment with both static and dynamic scene. The proposed method is a hybrid method that uses the conventional background subtraction for static scene and depth information for dynamic scene. Its validity and efficiency are verified by demonstration in dynamic environment, where a video projector projects various images in the background.
PDF

Enhancing Depth Accuracy on the Region of Interest in a Scene for Depth Image Based Rendering

Cho, Yongjoo;Seo, Kiyoung;Park, Kyoung Shin
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.8 no.7
- /
- pp.2434-2448
- /
- 2014
This research proposed the domain division depth map quantization for multiview intermediate image generation using Depth Image-Based Rendering (DIBR). This technique used per-pixel depth quantization according to the percentage of depth bits assigned in domains of depth range. A comparative experiment was conducted to investigate the potential benefits of the proposed method against the linear depth quantization on DIBR multiview intermediate image generation. The experiment evaluated three quantization methods with computer-generated 3D scenes, which consisted of various scene complexities and backgrounds, under varying the depth resolution. The results showed that the proposed domain division depth quantization method outperformed the linear method on the 7- bit or lower depth map, especially in the scene with the large object.
https://doi.org/10.3837/tiis.2014.07.013 인용 PDF KSCI KPUBS HTML

Hypergraph model based Scene Image Classification Method (하이퍼그래프 모델 기반의 장면 이미지 분류 기법)

Choi, Sun-Wook;Lee, Chong Ho
- Journal of the Korean Institute of Intelligent Systems
- /
- v.24 no.2
- /
- pp.166-172
- /
- 2014
Image classification is an important problem in computer vision. However, it is a very challenging problem due to the variability, ambiguity and scale change that exists in images. In this paper, we propose a method of a hypergraph based modeling can consider the higher-order relationships of semantic attributes of a scene image and apply it to a scene image classification. In order to generate the hypergraph optimized for specific scene category, we propose a novel search method based on a probabilistic subspace method and also propose a method to aggregate the expression values of the member semantic attributes that belongs to the searched subsets based on a linear transformation method via likelihood based estimation. To verify the superiority of the proposed method, we showed that the discrimination power of the feature vector generated by the proposed method is better than existing methods through experiments. And also, in a scene classification experiment, the proposed method shows a competitive classification performance compared with the conventional methods.
https://doi.org/10.5391/JKIIS.2014.24.2.166 인용 PDF KSCI

Search Result 945, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)