• Title/Summary/Keyword: 컴퓨터 비전 기술

Search Result 409, Processing Time 0.024 seconds

A Study on Mobile-based Obstacle Detection for Blinds (시각장애인을 위한 모바일 기반 장애물 탐지 연구)

  • Cho, Su-Hyeong;Kim, Ho-Jin;Park, Sang-Sun;Choi, Yu-Jun;Lee, Soowon
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2021.05a
    • /
    • pp.433-436
    • /
    • 2021
  • 사물 인식이란 컴퓨터에 입력되는 이미지에서 사용자가 정의한 사물들을 컴퓨터 비전 기술을 이용하여 인식하는 과정으로, 사물 인식을 이용하면 컴퓨터가 카메라를 통하여 입력되는 이미지에서 장애물 등 특정 사물의 인식 결과를 사용자에게 알려줄 수 있다. 본 논문에서는 YOLO 사물 인식 알고리즘을 이용하여 시각장애인에게 전방의 장애물을 인식하여 알려줄 수 있는 기술을 제시한다. 해당 기술은 실용성을 고려하여 모바일 환경에서 이용할 수 있으며, 서버와의 연동을 통해 실시간으로 사용자에게 사물 인식의 결과를 알려줄 수 있다.

Production of Media Art using OpenCV (OpenCV를 이용한 미디어 아트 제작)

  • Lee, MyounJae
    • Journal of the Korea Convergence Society
    • /
    • v.7 no.4
    • /
    • pp.173-180
    • /
    • 2016
  • OpenCV is a programming language used in digital image processing and computer vision. In this study, look at media arts made using OpenCV programming language and find out about the utilization possibilities. To this end, the first, look at OpenCV functions that are frequently used in media art, the examples of utilizing the functions. The second, discuss media arts using OpenCV. focused on the OpenCV functions, programming language for an production of media art. The third, analyze features of media arts using OpenCV, mainly focused on the functions and programming languages. The study may provide guidance to the artists to produce a media art using the OpenCV or programming language.

An Automatic OSD Verification Method using Computer Vision Techniques (컴퓨터 비전 기술을 이용한 OSD Menu 자동검증 기법)

  • Lee, Jin-Seok;Kang, Duek-Cheol;Cho, Yun-Seok;Kim, Ho-Joon
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2005.11a
    • /
    • pp.275-278
    • /
    • 2005
  • 본 연구는 디스플레이 제품의 개발 및 생산과정에서 OSD 메뉴문자의 오류 유무를 검사하는 과정을 컴퓨터 비전기술을 사용하여 자동화하는 방법을 제안한다. 디스플레이 제품의 OSD 메뉴는 순차적인 제어과정을 통해서 제한된 디스플레이 영역에 여러 종류의 언어와 기호를 포함하는 형태로 출력된다. 기존의 제품개발 과정에서 이러한 메뉴 항목의 정확성을 검증하는 작업은 작업자의 육안에 의한 판단과 수작업에 의해 이루어지고 있는데, 이는 반복작업에 의한 집중력 저하 및 판단착오에 의한 오류의 가능성을 내재한다. 또한 작업자가 다양한 나라의 언어에 대한 문자형태와 기호표현의 특성을 이해하여야 하고, 검증작업 자체에 따르는 부수적인 시간과 노력을 필요로 한다. 이에 본 연구에서는 디스플레이 제품의 OSD 메뉴와 같이 특수한 구조를 갖는 문서영상에 대한 논리적인 구조분석을 통해서 연속적인 문서영상을 발생시키는 작업스케쥴러를 생성하고, 작업스케쥴러에 의해 순차적으로 발생된 영상문서에 대한 전처리, OSD 메뉴의 기하학적 구조분석 및 문자영역을 추출하는 방법과, 표준패턴 구축 및 원형정합에 의한 문자의 오류를 검증하는 방법과 오류를 관리하는 기법을 제안한다.

  • PDF

Multi-lane Road Recognition Model Applying Computer Vision (컴퓨터비전을 적용한 다차선 도로 인식 모델)

  • Kim, Do-Young;Jang, Jong-Wook;Jang, Sung-Jin
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.10a
    • /
    • pp.317-319
    • /
    • 2021
  • In Korea, an intelligent transportation system(ITS) is established to efficiently operate traffic congestion on roads and is being used for traffic information collection and speed control systems. Currently, designated and dedicated lanes are in place to ensure traffic circulation and traffic safety, and systematic and accurate illegal vehicle crackdown systems with artificial intelligence technology are needed. In this study, we propose a vehicle number recognition model that can improve the efficiency of the traffic of designated vehicles. By applying computer vision technology, we are going to identify three-lane and four-lane multi-lane roads in real time and detect vehicle numbers by car to suggest ways to crack down on vehicles that violate the designated lane system.

  • PDF

YOLO based Optical Music Recognition and Virtual Reality Content Creation Method (YOLO 기반의 광학 음악 인식 기술 및 가상현실 콘텐츠 제작 방법)

  • Oh, Kyeongmin;Hong, Yoseop;Baek, Geonyeong;Chun, Chanjun
    • Smart Media Journal
    • /
    • v.10 no.4
    • /
    • pp.80-90
    • /
    • 2021
  • Using optical music recognition technology based on deep learning, we propose to apply the results derived to VR games. To detect the music objects in the music sheet, the deep learning model used YOLO v5, and Hough transform was employed to detect undetected objects, modifying the size of the staff. It analyzes and uses BPM, maximum number of combos, and musical notes in VR games using output result files, and prevents the backlog of notes through Object Pooling technology for resource management. In this paper, VR games can be produced with music elements derived from optical music recognition technology to expand the utilization of optical music recognition along with providing VR contents.

A Study on the Estimation of Multi-Object Social Distancing Using Stereo Vision and AlphaPose (Stereo Vision과 AlphaPose를 이용한 다중 객체 거리 추정 방법에 관한 연구)

  • Lee, Ju-Min;Bae, Hyeon-Jae;Jang, Gyu-Jin;Kim, Jin-Pyeong
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.10 no.7
    • /
    • pp.279-286
    • /
    • 2021
  • Recently, We are carrying out a policy of physical distancing of at least 1m from each other to prevent the spreading of COVID-19 disease in public places. In this paper, we propose a method for measuring distances between people in real time and an automation system that recognizes objects that are within 1 meter of each other from stereo images acquired by drones or CCTVs according to the estimated distance. A problem with existing methods used to estimate distances between multiple objects is that they do not obtain three-dimensional information of objects using only one CCTV. his is because three-dimensional information is necessary to measure distances between people when they are right next to each other or overlap in two dimensional image. Furthermore, they use only the Bounding Box information to obtain the exact coordinates of human existence. Therefore, in this paper, to obtain the exact two-dimensional coordinate value in which a person exists, we extract a person's key point to detect the location, convert it to a three-dimensional coordinate value using Stereo Vision and Camera Calibration, and estimate the Euclidean distance between people. As a result of performing an experiment for estimating the accuracy of 3D coordinates and the distance between objects (persons), the average error within 0.098m was shown in the estimation of the distance between multiple people within 1m.

Study of Methodology for Recognizing Multiple Objects (다중물체 인식 방법론에 관한 연구)

  • Lee, Hyun-Chang;Koh, Jin-Kwang
    • Journal of the Korea Society of Computer and Information
    • /
    • v.13 no.7
    • /
    • pp.51-57
    • /
    • 2008
  • In recent computer vision or robotics fields, the research area of object recognition from image using low cost web camera or other video device is performed actively. As study for this, there are various methodologies suggested to retrieve objects in robotics and vision research areas. Also, robotics is designed and manufactured to aim at doing like human being. For instance, a person perceives apples as one see apples because of previously knowing the fact that it is apple in one's mind. Like this, robotics need to store the information of any object of what the robotics see. Therefore, in this paper, we propose an methodology that we can rapidly recognize objects which is stored in object database by using SIFT (scale invariant feature transform) algorithm to get information about the object. And then we implement the methodology to enable to recognize simultaneously multiple objects in an image.

  • PDF

광학과 첨단영상이미징 기술의 '가교역할'및 고부가가치 응용기기 개발에서 '두각'

  • Park, Ji-Yeon
    • The Optical Journal
    • /
    • s.106
    • /
    • pp.36-38
    • /
    • 2006
  • (주)이즈미디어(대표ㆍ홍성철)는 컴퓨터이미징과 머신비전 소프트웨어 기반기술을 바탕으로 최근 폭발적인 성장세를 지속하고 있는 카메라폰과 관련한 렌즈모듈 검사장비를 주력으로 선보이며 시장에서 입지를 구축해 나가고 있다. 다양한 용도의 카메라를 만드는 것에서부터 평가하는 기술까지 고루 갖추고있는 이즈미디어는 첨단IT와 광학을 잇는 가교역할은 물론 다양한 고부가가치 응용광학계 개발을 통해 시너지를 바휘하고 있다.

  • PDF

Safety of Industrial Workers through the Development of Artificial Intelligence and A Study on Efficiency Improvement (인공지능의 발전을 통한 산업현장 근로자의 안전과 효율성 제고에 관한 연구)

  • Park, Gunuk
    • Proceedings of the Korean Society of Disaster Information Conference
    • /
    • 2023.11a
    • /
    • pp.123-124
    • /
    • 2023
  • 현대 산업현장에서의 생산성과 경쟁력은 안전 및 작업 효율성과 직결되어 있다. 특히, 4차 산업혁명의 중심축인 인공지능(AI) 기술의 발전이 산업현장의 작업 환경과 절차를 혁신하는 데 중요한 역할을 하고 있음이 점차 명확해지고 있다. 이 연구는 인공지능의 기술적 발전과 산업현장의 작업 안전성 및 효율성 간의 관계에 초점을 맞추어, 어떻게 AI 기술의 도입과 활용이 산업현장의 미래를 형성하고 있는지를 탐구하였다.

  • PDF

Python-based Software Education Model for Non-Computer Majors (컴퓨터 비전공자를 위한 파이썬 기반 소프트웨어 교육 모델)

  • Lee, Youngseok
    • Journal of the Korea Convergence Society
    • /
    • v.9 no.3
    • /
    • pp.73-78
    • /
    • 2018
  • Modern society has evolved to such an extent that computing technology has become an integral part of various fields, creating new and superior value to society. Education on computer literacy, including the ability to design and build software, is now becoming a universal education that must be acquired by everyone, regardless of the field of study. Many universities are imparting software education to students to improve their problem-solving ability, including to students who are not majoring in computers. However, software education contains courses that are meant for computer majors and many students encounter difficulty in learning the grammar of programming language. To solve this problem, this paper analyzes the research outcomes of the existing software education model and proposes a Python-based software education model for students who are not majoring in computer science. Along with a Python-based software education model, this paper proposed a curriculum that can be applied during one semester, including learning procedures, and teaching strategies. This curriculum was applied to a liberal arts class and a meaningful result was derived. If the proposed software education model is applied, the students will be interested in the computer literacy class and improve their computational thinking and problem-solving ability.