• Title/Summary/Keyword: vision AI

Search Result 153, Processing Time 0.025 seconds

Deep Learning OCR based document processing platform and its application in financial domain (금융 특화 딥러닝 광학문자인식 기반 문서 처리 플랫폼 구축 및 금융권 내 활용)

  • Dongyoung Kim;Doohyung Kim;Myungsung Kwak;Hyunsoo Son;Dongwon Sohn;Mingi Lim;Yeji Shin;Hyeonjung Lee;Chandong Park;Mihyang Kim;Dongwon Choi
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.1
    • /
    • pp.143-174
    • /
    • 2023
  • With the development of deep learning technologies, Artificial Intelligence powered Optical Character Recognition (AI-OCR) has evolved to read multiple languages from various forms of images accurately. For the financial industry, where a large number of diverse documents are processed through manpower, the potential for using AI-OCR is great. In this study, we present a configuration and a design of an AI-OCR modality for use in the financial industry and discuss the platform construction with application cases. Since the use of financial domain data is prohibited under the Personal Information Protection Act, we developed a deep learning-based data generation approach and used it to train the AI-OCR models. The AI-OCR models are trained for image preprocessing, text recognition, and language processing and are configured as a microservice architected platform to process a broad variety of documents. We have demonstrated the AI-OCR platform by applying it to financial domain tasks of document sorting, document verification, and typing assistance The demonstrations confirm the increasing work efficiency and conveniences.

The Camera Tracking of Real-Time Moving Object on UAV Using the Color Information (컬러 정보를 이용한 무인항공기에서 실시간 이동 객체의 카메라 추적)

  • Hong, Seung-Beom
    • Journal of the Korean Society for Aviation and Aeronautics
    • /
    • v.18 no.2
    • /
    • pp.16-22
    • /
    • 2010
  • This paper proposes the real-time moving object tracking system UAV using color information. Case of object tracking, it have studied to recognizing the moving object or moving multiple objects on the fixed camera. And it has recognized the object in the complex background environment. But, this paper implements the moving object tracking system using the pan/tilt function of the camera after the object's region extraction. To do this tracking system, firstly, it detects the moving object of RGB/HSI color model and obtains the object coordination in acquired image using the compact boundary box. Secondly, the camera origin coordination aligns to object's top&left coordination in compact boundary box. And it tracks the moving object using the pan/tilt function of camera. It is implemented by the Labview 8.6 and NI Vision Builder AI of National Instrument co. It shows the good performance of camera trace in laboratory environment.

Towards a Ubiquitous Robotic Companion: Design and Implementation of Ubiquitous Robotic Service Framework

  • Ha, Young-Guk;Sohn, Joo-Chan;Cho, Young-Jo;Yoon, Hyun-Soo
    • ETRI Journal
    • /
    • v.27 no.6
    • /
    • pp.666-676
    • /
    • 2005
  • In recent years, motivated by the emergence of ubiquitous computing technologies, a new class of networked robots, ubiquitous robots, has been introduced. The Ubiquitous Robotic Companion (URC) is our conceptual vision of ubiquitous service robots that provide users with the services they need, anytime and anywhere in ubiquitous computing environments. To realize the vision of URC, one of the essential requirements for robotic systems is to support ubiquity of services: that is, a robot service must be always available even though there are changes in the service environments. Specifically robotic systems need to be automatically interoperable with sensors and devices in current service environments, rather than statically preprogrammed for them. In this paper, the design and implementation of a semantic-based ubiquitous robotic space (SemanticURS) is presented. SemanticURS enables automated integration of networked robots into ubiquitous computing environments exploiting Semantic Web Services and AI-based planning technologies.

  • PDF

The History and Future of String Quartet Performances: Examining the Possibility of Convergent Performances Employing Media and Artificial Intelligence (현악사중주 공연의 역사와 미래: 미디어와 인공지능을 활용한 융합 공연의 가능성에 대하여)

  • Eun-Ji Park
    • The Journal of the Convergence on Culture Technology
    • /
    • v.9 no.5
    • /
    • pp.697-706
    • /
    • 2023
  • This study examines the history of string quartet performances and analyzes contemporary integrated performances to propose a new performance paradigm for future audiences. It examines past developments and audience interactions, and how modern classical performance can gain a competitive edge internationally through tech integration. Building on this foundation, a future vision is proposed for Korean string quartet performances, drawing from novel performances that are interconnected with their historical context. The study concludes that modern string quartets necessitate innovative and original performance directions that can be achieved through various technological integrations.

A Study on Image Annotation Automation Process using SHAP for Defect Detection (SHAP를 이용한 이미지 어노테이션 자동화 프로세스 연구)

  • Jin Hyeong Jung;Hyun Su Sim;Yong Soo Kim
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.46 no.1
    • /
    • pp.76-83
    • /
    • 2023
  • Recently, the development of computer vision with deep learning has made object detection using images applicable to diverse fields, such as medical care, manufacturing, and transportation. The manufacturing industry is saving time and money by applying computer vision technology to detect defects or issues that may occur during the manufacturing and inspection process. Annotations of collected images and their location information are required for computer vision technology. However, manually labeling large amounts of images is time-consuming, expensive, and can vary among workers, which may affect annotation quality and cause inaccurate performance. This paper proposes a process that can automatically collect annotations and location information for images using eXplainable AI, without manual annotation. If applied to the manufacturing industry, this process is thought to save the time and cost required for image annotation collection and collect relatively high-quality annotation information.

A Study on the Web Building Assistant System Using GUI Object Detection and Large Language Model (웹 구축 보조 시스템에 대한 GUI 객체 감지 및 대규모 언어 모델 활용 연구)

  • Hyun-Cheol Jang;Hyungkuk Jang
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2024.05a
    • /
    • pp.830-833
    • /
    • 2024
  • As Large Language Models (LLM) like OpenAI's ChatGPT[1] continue to grow in popularity, new applications and services are expected to emerge. This paper introduces an experimental study on a smart web-builder application assistance system that combines Computer Vision with GUI object recognition and the ChatGPT (LLM). First of all, the research strategy employed computer vision technology in conjunction with Microsoft's "ChatGPT for Robotics: Design Principles and Model Abilities"[2] design strategy. Additionally, this research explores the capabilities of Large Language Model like ChatGPT in various application design tasks, specifically in assisting with web-builder tasks. The study examines the ability of ChatGPT to synthesize code through both directed prompts and free-form conversation strategies. The researchers also explored ChatGPT's ability to perform various tasks within the builder domain, including functions and closure loop inferences, basic logical and mathematical reasoning. Overall, this research proposes an efficient way to perform various application system tasks by combining natural language commands with computer vision technology and LLM (ChatGPT). This approach allows for user interaction through natural language commands while building applications.

Cases of health care services for the elderly using IT technology and future development directions (IT 기술을 활용한 노인돌봄서비스 사례 및 개발 동향)

  • Kim, Han-byeol;Kim, Ji-hong;Lee, Sung-mo;Choi, Hun
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.10a
    • /
    • pp.496-498
    • /
    • 2022
  • With the prolonged spread of the new coronavirus infection worldwide and the entry of the super-aged society, smart health care, which combines IT technology for senior health care and the health care industry, is emerging as a solution to the aging problem. The development of non-face-to-face care services using Ai is on a global trend, not in some countries, and the form of care services for the elderly using AI artificial intelligence technology is changing rapidly. The convenience of AI-based care services for the elderly is expected to be highlighted, and the technology and market are expected to develop significantly. As the number of single-person households is increasing, the shortage of welfare workers for the elderly is emerging as a social issue. It is presented as a vision to solve long-term social problems such as the labor shortage of elderly care workers as well as the advantages of convenient care services using IT technology. Therefore, we would like to propose the development direction of care services for the elderly as a case study of care services for the elderly and a countermeasure against the super-aging age.

  • PDF

Development of AI and IoT-based smart farm pest prediction system: Research on application of YOLOv5 and Isolation Forest models (AI 및 IoT 기반 스마트팜 병충해 예측시스템 개발: YOLOv5 및 Isolation Forest 모델 적용 연구)

  • Mi-Kyoung Park;Hyun Sim
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.19 no.4
    • /
    • pp.771-780
    • /
    • 2024
  • In this study, we implemented a real-time pest detection and prediction system for a strawberry farm using a computer vision model based on the YOLOv5 architecture and an Isolation Forest Classifier. The model performance evaluation showed that the YOLOv5 model achieved a mean average precision (mAP 0.5) of 78.7%, an accuracy of 92.8%, a recall of 90.0%, and an F1-score of 76%, indicating high predictive performance. This system was designed to be applicable not only to strawberry farms but also to other crops and various environments. Based on data collected from a tomato farm, a new AI model was trained, resulting in a prediction accuracy of over 85% for major diseases such as late blight and yellow leaf curl virus. Compared to the previous model, this represented an improvement of more than 10% in prediction accuracy.

Trends of Plant Image Processing Technology (이미지 기반의 식물 인식 기술 동향)

  • Yoon, Y.C.;Sang, J.H.;Park, S.M.
    • Electronics and Telecommunications Trends
    • /
    • v.33 no.4
    • /
    • pp.54-60
    • /
    • 2018
  • In this paper, we analyze the trends of deep-learning based plant data processing technologies. In recent years, the deep-learning technology has been widely applied to various AI tasks, such as vision (image classification, image segmentation, and so on) and natural language processing because it shows a higher performance on such tasks. The deep-leaning method is also applied to plant data processing tasks and shows a significant performance. We analyze and show how the deep-learning method is applied to plant data processing tasks and related industries.

Pattern 인식을 위한 Neural Network

  • Kim, Myeong-Won;Lee, Gwang-Lo
    • ETRI Journal
    • /
    • v.11 no.1
    • /
    • pp.41-58
    • /
    • 1989
  • Neural network연구는 뇌로부터 얻은 아이디어를 공학적으로 응용하려는 생각을 바탕으로 뇌의 구조와 유사한 mechanism에 의한 정보처리장치의 기초가 되는 정보처리의 양식 확립과 함께 그 정보처리 양식을 구체적으로 각각의 정보처리 문제에 응용하기 위한 응용기술을 연구하는 것이다. Neural network의 계산 기능적 특성은 병렬처리, 학습 및 noisy한 정보의 효율적처리 등으로써 특히 pattern인식 문제에 효율적으로 응용될 수 있다. 본 논문에서는 neural network의 역사적 고찰과 기존의 model들을 살펴보고 새로운 계산 구조와 계산 방식을 가진 neural network의 응용분야를 살펴 봄으로써 기존의 AI 기법으로 해결하기 어려운 pattern recognition(image,문자,speech등), robot vision 및 control 등 여러가지 문제에 효율적으로 적용가능함과 neural network의 앞으로의 전망에 대하여 기술한다.

  • PDF