Search | Korea Science

Human-Computer Interaction Based Only on Auditory and Visual Information

Sha, Hui;Agah, Arvin
- Transactions on Control, Automation and Systems Engineering
- /
- v.2 no.4
- /
- pp.285-297
- /
- 2000
One of the research objectives in the area of multimedia human-computer interaction is the application of artificial intelligence and robotics technologies to the development of computer interfaces. This involves utilizing many forms of media, integrating speed input, natural language, graphics, hand pointing gestures, and other methods for interactive dialogues. Although current human-computer communication methods include computer keyboards, mice, and other traditional devices, the two basic ways by which people communicate with each other are voice and gesture. This paper reports on research focusing on the development of an intelligent multimedia interface system modeled based on the manner in which people communicate. This work explores the interaction between humans and computers based only on the processing of speech(Work uttered by the person) and processing of images(hand pointing gestures). The purpose of the interface is to control a pan/tilt camera to point it to a location specified by the user through utterance of words and pointing of the hand, The systems utilizes another stationary camera to capture images of the users hand and a microphone to capture the users words. Upon processing of the images and sounds, the systems responds by pointing the camera. Initially, the interface uses hand pointing to locate the general position which user is referring to and then the interface uses voice command provided by user to fine-the location, and change the zooming of the camera, if requested. The image of the location is captured by the pan/tilt camera and sent to a color TV monitor to be displayed. This type of system has applications in tele-conferencing and other rmote operations, where the system must respond to users command, in a manner similar to how the user would communicate with another person. The advantage of this approach is the elimination of the traditional input devices that the user must utilize in order to control a pan/tillt camera, replacing them with more "natural" means of interaction. A number of experiments were performed to evaluate the interface system with respect to its accuracy, efficiency, reliability, and limitation.
PDF

Gesture Control Gaming for Motoric Post-Stroke Rehabilitation

Andi Bese Firdausiah Mansur
- International Journal of Computer Science & Network Security
- /
- v.23 no.10
- /
- pp.37-43
- /
- 2023
The hospital situation, timing, and patient restrictions have become obstacles to an optimum therapy session. The crowdedness of the hospital might lead to a tight schedule and a shorter period of therapy. This condition might strike a post-stroke patient in a dilemma where they need regular treatment to recover their nervous system. In this work, we propose an in-house and uncomplex serious game system that can be used for physical therapy. The Kinect camera is used to capture the depth image stream of a human skeleton. Afterwards, the user might use their hand gesture to control the game. Voice recognition is deployed to ease them with play. Users must complete the given challenge to obtain a more significant outcome from this therapy system. Subjects will use their upper limb and hands to capture the 3D objects with different speeds and positions. The more substantial challenge, speed, and location will be increased and random. Each delegated entity will raise the scores. Afterwards, the scores will be further evaluated to correlate with therapy progress. Users are delighted with the system and eager to use it as their daily exercise. The experimental studies show a comparison between score and difficulty that represent characteristics of user and game. Users tend to quickly adapt to easy and medium levels, while high level requires better focus and proper synchronization between hand and eye to capture the 3D objects. The statistical analysis with a confidence rate(α:0.05) of the usability test shows that the proposed gaming is accessible, even without specialized training. It is not only for therapy but also for fitness because it can be used for body exercise. The result of the experiment is very satisfying. Most users enjoy and familiarize themselves quickly. The evaluation study demonstrates user satisfaction and perception during testing. Future work of the proposed serious game might involve haptic devices to stimulate their physical sensation.
https://doi.org/10.22937/IJCSNS.2023.23.10.5 인용 PDF

Development of 3-D Stereo PIV (3차원 스테레오 PIV 개발)

Kim Mi-Young;Choi Jang-Woon;Nam Koo-Man;Lee Young-Ho
- 한국가시화정보학회:학술대회논문집
- /
- 2002.11a
- /
- pp.19-22
- /
- 2002
A process of 3-D particle image velocimetry, called here, as '3-D stereo PIV' was developed for the measurement of a section field of 3-D complex flows. The present method includes modeling of camera by a calibrator based on the homogeneous coordinate system, transfromation of oblique-angled image to transformed image, identification of 2-D velocity vectors by 2-D cross-correlation equation, stereo matching of 2-D velocity vectors of two cameras, accurate calculation of 3-D velocity vectors by homogeneous coordinate system and finally 3-D animation as the post processing. In principle, as two frame images only are necessary for the single instantaneous analysis of a section field of 3-D flow, more effective vectors are obtainable contrary to the previous multi-frame vector algorithm. An experimental system was also used for the application of the proposed method. Three analog CCD cameras and an Argon-Ion Laser(300mW) for illumination were adopted to capture the wake flow behind a bluff obstacle.
PDF

Monosyllable Speech Recognition through Facial Movement Analysis (안면 움직임 분석을 통한 단음절 음성인식)

Kang, Dong-Won;Seo, Jeong-Woo;Choi, Jin-Seung;Choi, Jae-Bong;Tack, Gye-Rae
- The Transactions of The Korean Institute of Electrical Engineers
- /
- v.63 no.6
- /
- pp.813-819
- /
- 2014
The purpose of this study was to extract accurate parameters of facial movement features using 3-D motion capture system in speech recognition technology through lip-reading. Instead of using the features obtained through traditional camera image, the 3-D motion system was used to obtain quantitative data for actual facial movements, and to analyze 11 variables that exhibit particular patterns such as nose, lip, jaw and cheek movements in monosyllable vocalizations. Fourteen subjects, all in 20s of age, were asked to vocalize 11 types of Korean vowel monosyllables for three times with 36 reflective markers on their faces. The obtained facial movement data were then calculated into 11 parameters and presented as patterns for each monosyllable vocalization. The parameter patterns were performed through learning and recognizing process for each monosyllable with speech recognition algorithms with Hidden Markov Model (HMM) and Viterbi algorithm. The accuracy rate of 11 monosyllables recognition was 97.2%, which suggests the possibility of voice recognition of Korean language through quantitative facial movement analysis.
https://doi.org/10.5370/KIEE.2014.63.6.813 인용 PDF KSCI KPUBS HTML

3-Dimensional Micro Solder Ball Inspection Using LED Reflection Image

Kim, Jee Hong
- International journal of advanced smart convergence
- /
- v.8 no.3
- /
- pp.39-45
- /
- 2019
This paper presents an optical technique for the three-dimensional (3D) shape inspection of micro solder balls used in ball-grid array (BGA) packaging. The proposed technique uses an optical source composed of spatially arranged light-emitting diodes (LEDs) and the results are derived based on the specular reflection characteristics of the micro solder balls for BGA A vision system comprising a camera and LEDs is designed to capture the reflected images of multiple solder balls arranged arbitrarily on a tray and the locations of the LED point-light-source reflections in each ball are determined via image processing, for shape inspection. The proposed methodology aims to determine the presence of defects in 3D BGA shape using the statistical information of the relative positions of multiple BGA balls, which are included in the image. The presence of the BGA balls with large deviations in relative position imply the inconsistencies in their shape. Experiments were conducted to verify that the proposed method could be applied to inspection without sophisticated mechanism and productivity problem.
https://doi.org/10.7236/IJASC.2019.8.3.39 인용 PDF KSCI

Development of Automatic Visual Inspection for the Defect of Compact Camera Module

Ko, Kuk-Won;Lee, Yu-Jin;Choi, Byung-Wook;Kim, Johng-Hyung
- 제어로봇시스템학회:학술대회논문집
- /
- 2005.06a
- /
- pp.2414-2417
- /
- 2005
Compact Camera Module(CCM) is widely used in PDA, Celluar phone and PC web camera. With the greatly increasing use for mobile applications, there has been a considerable demands for high speed production of CCM. The major burden of production of CCM is assembly of lens module onto CCD or CMOS packaged circuit board. After module is assembled, the CCM is inspected. In this paper, we developed the image capture board for CCM and the imaging processing algorithm to inspect the defects in captured image of assembled CCMs. The performances of the developed inspection system and its algorithm are tested on samples of 10000 CCMs. Experimental results reveal that the proposed system can focus the lens of CCM within 5s and we can recognize various types of defect of CCM modules with good accuracy and high speed.
PDF

LED transceivers with beehive-shaped reflector for visible light communication

Sohn, Kyung-Rak;Kim, Min-Soo
- Journal of Advanced Marine Engineering and Technology
- /
- v.38 no.2
- /
- pp.169-174
- /
- 2014
This paper proposes a novel beehive-shaped reflector for application to light-emitting diode (LED) transceivers for illumination and bi-directional visible light communication (VLC). By using a diffuse propagation model extended to line-of-sight and direct signals, the distribution of illuminance and the path loss of the transceiver are investigated to evaluate the performance of the beehive-shaped reflector. To verify bi-directional communication, a VLC-based image capture system, comprising a complementary metal-oxide semiconductor (CMOS) image sensor and video processor unit, is demonstrated. Real-time images captured by the CMOS camera are successfully transmitted to the monitoring system via a free-space channel at a rate of 115.2 kbps.
https://doi.org/10.5916/jkosme.2014.38.2.169 인용 PDF KSCI

Forest Fire Detection System using Drone Streaming Images (드론 스트리밍 영상 이미지 분석을 통한 실시간 산불 탐지 시스템)

Yoosin Kim
- Journal of Advanced Navigation Technology
- /
- v.27 no.5
- /
- pp.685-689
- /
- 2023
The proposed system in the study aims to detect forest fires in real-time stream data received from the drone-camera. Recently, the number of wildfires has been increasing, and also the large scaled wildfires are frequent more and more. In order to prevent forest fire damage, many experiments using the drone camera and vision analysis are actively conducted, however there were many challenges, such as network speed, pre-processing, and model performance, to detect forest fires from real-time streaming data of the flying drone. Therefore, this study applied image data processing works to capture five good image frames for vision analysis from whole streaming data and then developed the object detection model based on YOLO_v2. As the result, the classification model performance of forest fire images reached upto 93% of accuracy, and the field test for the model verification detected the forest fire with about 70% accuracy.
https://doi.org/10.12673/jant.2023.27.5.685 인용 PDF HTML

A Similarity Ranking Algorithm for Image Databases (이미지 데이터베이스 유사도 순위 매김 알고리즘)

Cha, Guang-Ho
- Journal of KIISE:Databases
- /
- v.36 no.5
- /
- pp.366-373
- /
- 2009
In this paper, we propose a similarity search algorithm for image databases. One of the central problems regarding content-based image retrieval (CBIR) is the semantic gap between the low-level features computed automatically from images and the human interpretation of image content. Many search algorithms used in CBIR have used the Minkowski metric (or $L_p$-norm) to measure similarity between image pairs. However those functions cannot adequately capture the aspects of the characteristics of the human visual system as well as the nonlinear relationships in contextual information. Our new search algorithm tackles this problem by employing new similarity measures and ranking strategies that reflect the nonlinearity of human perception and contextual information. Our search algorithm yields superior experimental results on a real handwritten digit image database and demonstrates its effectiveness.
PDF KSCI

A Non-invasive Real-time Respiratory Organ Motion Tracking System for Image Guided Radio-Therapy (IGRT를 위한 비침습적인 호흡에 의한 장기 움직임 실시간 추적시스템)

Kim, Yoon-Jong;Yoon, Uei-Joong
- Journal of Biomedical Engineering Research
- /
- v.28 no.5
- /
- pp.676-683
- /
- 2007
A non-invasive respiratory gated radiotherapy system like those based on external anatomic motion gives better comfortableness to patients than invasive system on treatment. However, higher correlation between the external and internal anatomic motion is required to increase the effectiveness of non-invasive respiratory gated radiotherapy. Both of invasive and non-invasive methods need to track the internal anatomy with the higher precision and rapid response. Especially, the non-invasive method has more difficulty to track the target position successively because of using only image processing. So we developed the system to track the motion for a non-invasive respiratory gated system to accurately find the dynamic position of internal structures such as the diaphragm and tumor. The respiratory organ motion tracking apparatus consists of an image capture board, a fluoroscopy system and a processing computer. After the image board grabs the motion of internal anatomy through the fluoroscopy system, the computer acquires the organ motion tracking data by image processing without any additional physical markers. The patients breathe freely without any forced breath control and coaching, when this experiment was performed. The developed pattern-recognition software could extract the target motion signal in real-time from the acquired fluoroscopic images. The range of mean deviations between the real and acquired target positions was measured for some sample structures in an anatomical model phantom. The mean and max deviation between the real and acquired positions were less than 1mm and 2mm respectively with the standardized movement using a moving stage and an anatomical model phantom. Under the real human body, the mean and maximum distance of the peak to trough was measured 23.5mm and 55.1mm respectively for 13 patients' diaphragm motion. The acquired respiration profile showed that human expiration period was longer than the inspiration period. The above results could be applied to respiratory-gated radiotherapy.
https://doi.org/10.9718/JBER.2007.28.5.676 인용 PDF KSCI

Search Result 254, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)