• 제목/요약/키워드: Scene Recognition

검색결과 193건 처리시간 0.029초

Detecting and Segmenting Text from Images for a Mobile Translator System

  • Chalidabhongse, Thanarat H.;Jeeraboon, Poonsak
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 2004년도 ICCAS
    • /
    • pp.875-878
    • /
    • 2004
  • Researching in text detection and segmentation has been done for a long period in the OCR area. However, there is some other area that the text detection and segmentation from images can be very useful. In this report, we first propose the design of a mobile translator system which helps non-native speakers to understand the foreign language using ubiquitous mobile network and camera mobile phones. The main focus of the paper will be the algorithm in detecting and segmenting texts embedded in the natural scenes from taken images. The image, which is captured by a camera mobile phone, is transmitted to a translator server. It is initially passed through some preprocessing processes to smooth the image as well as suppress noises. A threshold is applied to binarize the image. Afterward, an edge detection algorithm and connected component analysis are performed on the filtered image to find edges and segment the components in the image. Finally, the pre-defined layout relation constraints are utilized in order to decide which components likely to be texts in the image. A preliminary experiment was done and the system yielded a recognition rate of 94.44% on a set of 36 various natural scene images that contain texts.

  • PDF

Automatic Display of an Additional Explanation on a Keyword Written by a Lecturer for e-Learning Using a Pen Capture Tool on Whiteboard and Two Cameras

  • Nishikimi, Kazuyuki;Yada, Yuuki;Tsuruoka, Shinji;Yoshikawa, Tomohiro;Shinogi, Tsuyoshi
    • 한국지능시스템학회:학술대회논문집
    • /
    • 한국퍼지및지능시스템학회 2003년도 ISIS 2003
    • /
    • pp.102-105
    • /
    • 2003
  • "e-Leaning" system is classified by lecture time into two types, that is, "synchronous type" spent the same lecture time between the lecturer and students, and "asynchronous type" spent the different lecture time. The size of image database is huge, and there are some problem on the management of the lecture image database in "asynchronous type" e-Learning system. The one of them is that the time tag for the database management must be added manually at present, and the cost of the addition of the time tag causes a serious problem. To resolve the problem, we will use the character recognition for the characters written by the lecturer on whiteboard, and will add the recognized character as a keyword to the tag of the image database. If the database would have the keyword, we could retrieve the database by the keyword efficiently, and the student could select the interested lecture scene only in the full lecture database.

  • PDF

Underwater 3D Reconstruction for Underwater Construction Robot Based on 2D Multibeam Imaging Sonar

  • Song, Young-eun;Choi, Seung-Joon
    • 한국해양공학회지
    • /
    • 제30권3호
    • /
    • pp.227-233
    • /
    • 2016
  • This paper presents an underwater structure 3D reconstruction method using a 2D multibeam imaging sonar. Compared with other underwater environmental recognition sensors, the 2D multibeam imaging sonar offers high resolution images in water with a high turbidity level by showing the reflection intensity data in real-time. With such advantages, almost all underwater applications, including ROVs, have applied this 2D multibeam imaging sonar. However, the elevation data are missing in sonar images, which causes difficulties with correctly understanding the underwater topography. To solve this problem, this paper concentrates on the physical relationship between the sonar image and the scene topography to find the elevation information. First, the modeling of the sonar reflection intensity data is studied using the distances and angles of the sonar beams and underwater objects. Second, the elevation data are determined based on parameters like the reflection intensity and shadow length. Then, the elevation information is applied to the 3D underwater reconstruction. This paper evaluates the presented real-time 3D reconstruction method using real underwater environments. Experimental results are shown to appraise the performance of the method. Additionally, with the utilization of ROVs, the contour and texture image mapping results from the obtained 3D reconstruction results are presented as applications.

시간과 빛의 변화에 따른 자연색채 감성의 변화연구 (The Changes of Color Emotions According to the Time flow and Natural Environmental Color Changes)

  • 이정안;이연주
    • 한국실내디자인학회:학술대회논문집
    • /
    • 한국실내디자인학회 2006년도 추계학술발표대회 논문집
    • /
    • pp.121-124
    • /
    • 2006
  • This study was done with a goal to observe changes of color emotions according to time flow and light changes as well as to study its moaning of color experience of natural scene to modern city dwellers with artificial surroundings. Individuals develop various feelings after seeing a color, but there is sometimes a common feeling raised among these various feelings. This study aims to investigate the influence of natural colors of surroundings on the emotions of human beings. First of all, we tried to discover how feelings change after a person is reminded of a color through an experience (recognition) of a natural color. Second, differences in feelings resulting from color perception are analyzed after time passes (sunrise, daytime, and sunset) and the colors of natural surroundings change accordingly. Survey was done in the period of $Jul.8^{th}$ to $10^{th}$, 2005 with 100 people (55 male and 45 female) in various professions and various ages between twenties to forties as respondent.

  • PDF

ZigBee 토폴로지를 이용한 스마트 홈 네트워크 시스템 설계 (Design of Smart Home Network System based on ZigBee Topology)

  • 유단;김광준;이진우
    • 한국전자통신학회논문지
    • /
    • 제7권3호
    • /
    • pp.537-543
    • /
    • 2012
  • 스마트 홈 시스템은 종합적인 네트워크 지능 홈 제어 시스템에서 실제적이며, 자동제어 시스템, 컴퓨터 네트워크 시스템과 네트워크 통신 기술이다. 지능적인 홈 시스템은 사용자로 하여금 가옥, 무선 원격 제어, 터치스크린 전화, 인터넷 또는 음성 인식 제어 가정용 장치를 화면 조작 또는 장치들을 연결함으로서 보다 편리하게 해줄 수 있다. 본 논문에서는 상호간의 서로 다른 상태의 동작에 따른 사용자 요구가 필요가 없는 상호간의 통신이 가능한 다양한 종류의 지능적인 가정용 장치를 구현함으로서 사용자가 대단히 효율적이고 편리하며 안전하도록 설계하였다.

A Knowledge-Based Machine Vision System for Automated Industrial Web Inspection

  • Cho, Tai-Hoon;Jung, Young-Kee;Cho, Hyun-Chan
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제1권1호
    • /
    • pp.13-23
    • /
    • 2001
  • Most current machine vision systems for industrial inspection were developed with one specific task in mind. Hence, these systems are inflexible in the sense that they cannot easily be adapted to other applications. In this paper, a general vision system framework has been developed that can be easily adapted to a variety of industrial web inspection problems. The objective of this system is to automatically locate and identify \\\"defects\\\" on the surface of the material being inspected. This framework is designed to be robust, to be flexible, and to be as computationally simple as possible. To assure robustness this framework employs a combined strategy of top-down and bottom-up control, hierarchical defect models, and uncertain reasoning methods. To make this framework flexible, a modular Blackboard framework is employed. To minimize computational complexity the system incorporates a simple multi-thresholding segmentation scheme, a fuzzy logic focus of attention mechanism for scene analysis operations, and a partitioning if knowledge that allows concurrent parallel processing during recognition.cognition.

  • PDF

차량 번호판 인식을 위한 앙상블 학습기 기반의 최적 특징 선택 방법 (An Ensemble Classifier Based Method to Select Optimal Image Features for License Plate Recognition)

  • 조재호;강동중
    • 전기학회논문지
    • /
    • 제65권1호
    • /
    • pp.142-149
    • /
    • 2016
  • This paper proposes a method to detect LP(License Plate) of vehicles in indoor and outdoor parking lots. In restricted environment, there are many conventional methods for detecting LP. But, it is difficult to detect LP in natural and complex scenes with background clutters because several patterns similar with text or LP always exist in complicated backgrounds. To verify the performance of LP text detection in natural images, we apply MB-LGP feature by combining with ensemble machine learning algorithm in purpose of selecting optimal features of small number in huge pool. The feature selection is performed by adaptive boosting algorithm that shows great performance in minimum false positive detection ratio and in computing time when combined with cascade approach. MSER is used to provide initial text regions of vehicle LP. Throughout the experiment using real images, the proposed method functions robustly extracting LP in natural scene as well as the controlled environment.

대학생의 심폐소생술에 대한 교육경험에 따른 지식 - 일 광역시를 중심으로 - (Knowledge According to Learning Experiences of CPR for Health Occupation College Students)

  • 엄동춘;전명희;황지영;최지예
    • 한국간호교육학회지
    • /
    • 제14권1호
    • /
    • pp.138-146
    • /
    • 2008
  • Purpose: The first responder's role during a cardiac arrest scene is to initiate CPR. The AHA has recognized and included the first responder's role for improving the survival rate of cardiac arrest patients. Health personnel working in nursing, emergency care, dental hygiene, radiology, and ocular optics frequently confront sudden cardiac arrest while working. This study was to identify the relationship between the educational experience and recognition with the level of knowledge about CPR for college students. Method: Five hundred forty college students enrolled in the department of nursing science, radiological technology, ocular optics, emergency medical technician, or dental hygiene in Daejeon city were surveyed. The tool used was CPR knowledge developed by the authors based on a literature review including 2005 AHA's CPR guideline. Result: The higher educational experience of CPR was, the higher the level of knowledge. The knowledge of the students in nursing or emergency medical technician was higher than students in dental hygiene, radiology, and ocular optics. Conclusion: CPR class should be included in the curriculum for college students in order to improve their accuracy as a first responder to cardiac arresting patients.

실감미디어 기반의 다감각 가상 체감시스템 개발에 관한 연구 (A Study on the Development of Multi-sensory Virtual Reality System based on Realistic Media)

  • 이현철;박기창;김은석;허기택
    • 한국멀티미디어학회논문지
    • /
    • 제20권9호
    • /
    • pp.1574-1583
    • /
    • 2017
  • This paper proposes how to develop a multi-sensory virtual reality system based on realistic media that can improve the sense of immersion and reality experienced by the user. We suggest four types of multi-sensory virtual reality system; a realistic media experience system which provides sensory experiences to user by interlocking the media file with the sensory informations and reproducing the sensory information suitable for the scene, a real image-based panorama experience system which maximizes the sense of reality, an experience ball system in which users engage themselves into the system environment to lead the story and immersion of the content through interaction with the system, and a cultural heritage experience system based on hand movement recognition. The suggested systems can be applied in a various area such as education, advertisement, culture and arts, performance, exhibition, sports, game, 4D Experience Center, and so on. We supposed that it can contribute to create a variety of sensible contents services in the realistic media industry through the convergence of media, contents, and devices.

Label Restoration Using Biquadratic Transformation

  • Le, Huy Phat;Nguyen, Toan Dinh;Lee, Guee-Sang
    • International Journal of Contents
    • /
    • 제6권1호
    • /
    • pp.6-11
    • /
    • 2010
  • Recently, there has been research to use portable digital camera to recognize objects in natural scene images, including labels or marks on a cylindrical surface. In many cases, text or logo in a label can be distorted by a structural movement of the object on which the label resides. Since the distortion in the label can degrade the performance of object recognition, the label should be rectified or restored from deformations. In this paper, a new method for label detection and restoration in digital images is presented. In the detection phase, the Hough transform is employed to detect two vertical boundaries of the label, and a horizontal edge profile is analyzed to detect upper-side and lower-side boundaries of the label. Then, the biquadratic transformation is used to restore the rectangular shape of the label. The proposed algorithm performs restoration of 3D objects in a 2D space, and it requires neither an auxiliary hardware such as 3D camera to construct 3D models nor a multi-camera to capture objects in different views. Experimental results demonstrate the effectiveness of the proposed method.