• Title/Summary/Keyword: vision limitation

Search Result 76, Processing Time 0.025 seconds

Vision-based Real-time Traffic Surveillance System (영상 기반의 실시간 교통 감시 시스템)

  • Park, Se-Hyun;Jung, Kee-Chul;Hea, Jun-Koo;Kim, Hang-Joon
    • Journal of the Korean Institute of Telematics and Electronics C
    • /
    • v.36C no.8
    • /
    • pp.62-69
    • /
    • 1999
  • This paper presents a vision-based real-time traffic surveillance system. Current research in machine vision applied to traffic is due to its potential for more powerful process, its flexibility and its lower cost. Our traffic surveillance system consists of FPA and TSM on the Internet. The FPA provides image data, vehicle speed and vehicle density for the TSM. Traffic parameters, the vehicle speed and density, are extracted over many frames using differences between the sampling points of a background image and those of incoming frames. The FPA works by just processing groups of pixels without any understanding of the image. Although this system has an accuracy limitation, it can be used in applications requiring an approximate vehicle density and vehicle speed in real-time.

  • PDF

The effect of health related characteristics on the use of information and communication technology of older adults (노인의 정보화 수준에 영향을 미치는 건강특성 연구)

  • Koo, Bon Mi;Joo, Ik Hyun
    • 한국노년학
    • /
    • v.40 no.4
    • /
    • pp.729-746
    • /
    • 2020
  • For the development and better use of information and communication technology(ICT) for promoting health and quality of life in older people, it is important to understand personal factors associated with ICT use. The aims of this study were to describe the pattern of ICT use according to health characteristics of elderly and to investigate significant factors related to ICT use among three age groups. The data of 2017 National Survey of Older Koreans was used for analysis and 10,058 of older adults aged 65 and over were included for study participants. As results, first, the young old(65~74) used more functions of ICT than the middle-old(75~84) and the old-old(85 and over). Second, ICT was more used among older people with good conditions in chronic disease, subjective health, cognition, depression, vision, hearing, walking and IADL. Third, higher prevalence of ICT use was significantly associated with age, education, household income, subjective health, depression, cognitive function, vision limitation and walking limitation. Fourth, factors influencing the use of ICT were a little different among three age groups. For the young-old(65~74), subjective health, depression, cognitive decline, walking limitation, and IADL disability were significant factors affecting on ICT use. For the middle-old(75-84), subjective health, depression, cognitive decline, vision, hearing, and walking limitation were related with their ICT use. For the old-old(85 and over), only cognitive function and depression were associated with their ICT use. Based on these results, this study suggested the need to consider multiple health conditions for developing and using ICT for older adults.

Landmark Initialization for Unscented Kalman Filter Sensor Fusion in Monocular Camera Localization

  • Hartmann, Gabriel;Huang, Fay;Klette, Reinhard
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.13 no.1
    • /
    • pp.1-11
    • /
    • 2013
  • The determination of the pose of the imaging camera is a fundamental problem in computer vision. In the monocular case, difficulties in determining the scene scale and the limitation to bearing-only measurements increase the difficulty in estimating camera pose accurately. Many mobile phones now contain inertial measurement devices, which may lend some aid to the task of determining camera pose. In this study, by means of simulation and real-world experimentation, we explore an approach to monocular camera localization that incorporates both observations of the environment and measurements from accelerometers and gyroscopes. The unscented Kalman filter was implemented for this task. Our main contribution is a novel approach to landmark initialization in a Kalman filter; we characterize the tolerance to noise that this approach allows.

THE CLINICAL STUDY OF THE OPTIC NERVE INJURY AFTER FACIAL TRAUMA (안면골 골절 후 시신경 손상에 관한 고찰)

  • Park, Je-Uk;Yoon, Kyoung-In
    • Journal of the Korean Association of Oral and Maxillofacial Surgeons
    • /
    • v.26 no.6
    • /
    • pp.677-680
    • /
    • 2000
  • Orbital injuries are common with facial trauma. Direct injuries to the globe are not rare but it can result in complications such as chemosis, subconjunctival hemorrhage and hyphema. Periorbital trauma or injuries to the extraocular muscles and blow-out fracture may result in lid edema, ecchymosis & ptosis and diplopia or limitation of ocular motion respectively. Indirect injuries to the optic nerve come up without any injuries but its complication is irreversible and severe such as loss of vision. The aim of this study is to review the literature on blindness or ptosis following facial trauma and present the cases of blindness after facial trauma and ptosis after mandibular fracture without specific clinical findings.

  • PDF

A Deep Learning Approach for Classification of Cloud Image Patches on Small Datasets

  • Phung, Van Hiep;Rhee, Eun Joo
    • Journal of information and communication convergence engineering
    • /
    • v.16 no.3
    • /
    • pp.173-178
    • /
    • 2018
  • Accurate classification of cloud images is a challenging task. Almost all the existing methods rely on hand-crafted feature extraction. Their limitation is low discriminative power. In the recent years, deep learning with convolution neural networks (CNNs), which can auto extract features, has achieved promising results in many computer vision and image understanding fields. However, deep learning approaches usually need large datasets. This paper proposes a deep learning approach for classification of cloud image patches on small datasets. First, we design a suitable deep learning model for small datasets using a CNN, and then we apply data augmentation and dropout regularization techniques to increase the generalization of the model. The experiments for the proposed approach were performed on SWIMCAT small dataset with k-fold cross-validation. The experimental results demonstrated perfect classification accuracy for most classes on every fold, and confirmed both the high accuracy and the robustness of the proposed model.

Bimodal Approach of Multi-Sensor Integration for Telematics Application (텔레매틱스 응용을 위한 다중센서통합의 이중 접근구조)

  • 김성백;이승용;최지훈;장병태;이종훈
    • Proceedings of the IEEK Conference
    • /
    • 2003.11a
    • /
    • pp.525-528
    • /
    • 2003
  • In this paper, we present a novel idea to integrate low cost Inertial Measurement Unit(IMU) and Differential Global Positioning System (DGPS) for Telematics applications. As well known, low cost IMU produces large positioning and attitude errors in very short time due to the poor quality of inertial sensor assembly. To conquer the limitation, we present a bimodal approach for integrating IMU and DGPS, taking advantage of positioning and orientation data calculated from CCD images based on photogrammetry and stereo-vision techniques. The positioning and orientation data from the photogrammetric approach are fed back into the Kalman filter to reduce and compensate IMU errors and improve the performance. Experimental results are presented to show the robustness of the proposed method that can provide accurate position and attitude information for extended period for non-aided GPS information.

  • PDF

Recent Development of Automated Strain Measurement System for Sheet Metal Parts (판재 변형률 자동측정시스템의 발전)

  • 김형종
    • Proceedings of the Korean Society for Technology of Plasticity Conference
    • /
    • 2000.04a
    • /
    • pp.129-133
    • /
    • 2000
  • It is reasonable to use the stereo vision and image processing technique to digitize 3D coordinates of grid points and to evaluate surface strains on a sheet metal parts. However this method has its intrinsic problems such as the difficulty in enhancement of bad images inevitable error due to digital image resolution of camera and frame grabber unreliability of strains and thickness evaluated from coarse grid on the corner area with large curvature and the limitation of the area that can be measured at a time. Therefore it is still hard to measure strain distribution over the entire surface of a medium,- or large-sized stamped part at a time even by using an automated strain measurement system. In this study the curvature correction algorithm based on the grid refinement and the geometry assembling algorithm based on the global error minimization (GEM) scheme are suggested. Several applications are presented to show the reliability and efficiency of these algorithms.

  • PDF

Multi-resolution Pyramid based Image Identification (다중 해상도 피라미드 기반 영상 인식자)

  • Park, Je-Ho
    • Journal of the Semiconductor & Display Technology
    • /
    • v.19 no.1
    • /
    • pp.6-10
    • /
    • 2020
  • Unlike modern photography technology, in the early days, efforts to physically compose an image with a concept similar to the current photograph have not been popular or commercially successful. The limitation of the use of images as artistic media or recordings has reached the stage of introducing the technology of image analysis to automate the function that humans recognize and judge through vision. In addition, the accuracy of the image has exceeded the human visual ability, enabling the technology that enables the step of recognizing and informing the fact that the human is not aware of it. Based on such a base, the range that can be applied through the image data in the future era can be said to be unpredictable, and the technology that targets large scale image database instead of an image is also expanding the possibilities as a new application technology. In order to identify a particular image from a massive database, different methodologies have been introduced. In this paper, we discuss image identifier production methods based on multi-resolution pyramid.

Tunnel lane-positioning system for autonomous driving cars using LED chromaticity and fuzzy logic system

  • Jeong, Jae-Hoon;Byun, Gi-Sig;Park, Kiwon
    • ETRI Journal
    • /
    • v.41 no.4
    • /
    • pp.506-514
    • /
    • 2019
  • Currently, studies on autonomous driving are being actively conducted. Vehicle positioning techniques are very important in the autonomous driving area. Currently, the global positioning system (GPS) is the most widely used technology for vehicle positioning. Although technologies such as the inertial navigation system and vision are used in combination with GPS to enhance precision, there is a limitation in measuring the lane and position in shaded areas of GPS, like tunnels. To solve such problems, this paper presents the use of LED lighting for position estimation in GPS shadow areas. This paper presents simulations in the environment of three-lane tunnels with LEDs of different color temperatures, and the results show that position estimation is possible by the analyzing chromaticity of LED lights. To improve the precision of positioning, a fuzzy logic system is added to the location function in the literature [1]. The experimental results showed that the average error was 0.0619 cm, and verify that the performance of developed position estimation system is viable compared with previous works.

Enhancing Video Storyboarding with Artificial Intelligence: An Integrated Approach Using ChatGPT and Midjourney within AiSAC

  • Sukchang Lee
    • International Journal of Advanced Culture Technology
    • /
    • v.11 no.3
    • /
    • pp.253-259
    • /
    • 2023
  • The increasing incorporation of AI in video storyboard creation has been observed recently. Traditionally, the production of storyboards requires significant time, cost, and specialized expertise. However, the integration of AI can amplify the efficiency of storyboard creation and enhance storytelling. In Korea, AiSAC stands at the forefront of AI-driven storyboard platforms, boasting the capability to generate realistic images built on open datasets foundations. Yet, a notable limitation is the difficulty in intricately conveying a director's vision within the storyboard. To address this challenge, we proposed the application of image generation features from ChatGPT and Midjourney to AiSAC. Through this research, we aimed to enhance the efficiency of storyboard production and refined the intricacy of expression, thereby facilitating advancements in the video production process.